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DESCRIPTION 

DNA Polymerases Having Modified Nuc!leot:idf> 

5 Binding Site for DNA SequeTir--i ng 

This application is a continuation-in-part of Tabor 
and Richardson, entitled "DNA POLYMERASES. HAVING MODIFIED 
NUCLEOTIDE BINDING SITE FOR DNA SEQUENCING, " f iled 
October 11, 1994, the whole of which (including drawings) 
is hereby incorporated by reference herein. 
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Background of the Invpnfinn 

This invention was made with government support 
15 including a grant from the U.S. Dept of Energy, contract 
number DE-FG02-88ER60688 . The U.S. government may have 
certain rights in the invention. 

This invention relates to DNA polymerases suitable 
for DNA sequencing and to automated and manual methods 
20 for DNA sequencing. 

The following is a brief description of art relevant 
to DNA sequencing techniques. This is provided only to 
give general guidance to those reading the application, 
and is not an admission that any art cited herein or 
25 referred to explicitly or implicitly is prior art to the 
appended claims. 

DNA sequencing generally involves the generation of 
four populations of single-stranded DNA fragments having 
one defined terminus and one variable terminus. The 
30 variable terminus generally teirminates at specific 
nucleotide bases (either guanine (G) , adenine (A) , 
thymine (T) , or cytosine (C) ) . The four different sets 
of fragments are each separated on the basis of their 
length, one procedure being on a high resolution 
35 polyacrylamide gel; each band on the gel corresponds 
colinearly to a specific nucleotide in the DNA sequence, 
thus identifying the positions in the sequence of the 
given nucleotide base. See Tabor and Richardson, U.S. 
Patents 4,942,130 and 4,962,020. 
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There are two general methods of DNA sequencing. 
One method (Maxam and Gilbert sequencing) involves the 
chemical degradation of isolated DNA fragments, each 
labeled with a single radiolabel at its defined terminus. 
5 each reaction yielding a limited cleavage specifically at 
one or more of the four bases (G, A, T or C) . The other 
method (dideoxy or chain- termination sequencing) involves 
the enzymatic synthesis of a DNA strand. Sanger et al . 
(Proc. Nat. Acad. Sci. USA 74:5463, 1977). Four 
separate syntheses are generally run, each reaction being 
caused to terminate at a specific base (G, A, T or C) via 
incorporation of an appropriate . chain terminating 
nucleotide, such as a dideoxynucleotide . The latter 
method is preferred since the DNA fragments can be 
uniformly labelled (instead of end labelled) by the 
inclusion of a radioactively labeled nucleoside 
triphosphate and thus the larger DNA fragments contain 
increasingly more radioactivity. Further, "S- labelled 
nucleotides can be used in place of "P-labelled 
20 nucleotides, resulting in sharper definition; the 
reaction products are easier to interpret since each lane 
corresponds only to either G, A, T or C. The enzymes 
used for most dideoxy sequencing include T7 DNA 
polymerase and DNA polymerases isolated from thermophilic 
25 organisms, such as Taq, Vent, Tth. and others. Other 
polymerases used to a lesser extent include AMV reverse 
transcriptase and Klenow fragment of B. coli DNA 
polymerase I. 

In the dideoxy chain terminating method a short 
single-stranded primer is annealed to a single-stranded 
template. The primer is elongated at its 3 '-end by the 
incorporation of deoxynucleotides (dNMPs) until a 
dideoxynucleotide (ddNMP) is incorporated. When a ddNMP 
is incorporated elongation ceases at that base. Other 
chain terminating agents can be used in place of a ddNTP 
and the ddNTP can be labelled as discussed below. 

Using the above methodology, automated systems for 
DNA sequence analysis have been developed. One 
instrument, which was manufactured by EG&G, makes use of 
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conventional dideoxy chain terminating reactions with a 
radioactively labeled nucleotide. The resulting DNA 
products are separated by gel electrophoresis. Toneguzzo 
et al , 6 Biotechnigues 460, 1988. A detector scans for 
5 radioactivity as it passes through the bottom of the gel. 
Four synthesis reactions are required for each template 
to be sequenced, as well as four lanes on each gel, a 
separate lane being used for products terminated by each 
specific chain terminating agent. 
10 Kambara et al , 6 Biotechnology 816, 1988, have used 

a fluorescent -labelled primer. The resulting 

fluorescently labelled products are excited with a laser 
at the bottom of the gel and the fluorescence detected 
with a CRT monitor. This procedure also requires four 
15 synthesis reactions and four lanes on the gel for each 
template to be secpaenced. 

Applied Biosystems manufactures an instrument in 
. which four different primers are used, each labelled with 
a different fluorescent marker. Smith et aJ . , 13 Nuc. 
20 Acid. Res. 2399, 1985; and 321 Nature 674, 1986. Each 
primer is used in a separate reaction containing one of 
four dideoxynucleotides . After the four reactions have 
been carried out the mixtures are combined and the DNA 
fragments are fractionated in a single lane on a gel. A 
25 laser, at the bottom of the gel is used to detect 
fluorescent products after they have been elect rophoresed 
through the gel. This system requires four separate 
annealing reactions and four separate synthesis reactions 
for each template, but only a single lane on the gel. 
30 Computer analysis of the sequence is made easier by 
having all four bands in a single lane. 

DuPont used to provide an instrument in which a 
different fluorescent marker was attached to each of four 
dideoxynucleoside triphosphates. Prober et al . , 238 
35 Science 336, 1987. A single annealing step, a single 
polymerase reaction (containing each of the four labelled 
dideoxynucleoside triphosphates) and a single lane in the 
sequencing gel are required. The four different 
fluorescent markers in the DNA products are detected 
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separately as they are electrophoresed through the gel . 

Englert et al . , U.S. Pat. No. 4,707,235 (1987), 
describes a multichannel electrophoresis apparatus having 
a detection means, disposed substantially across the 
5 whole width of the gel, that can detect labelled DNA 
products as they migrate past the detector means in four 
separate lanes, and identifies the channel or lane in 
which the sample is located. Preferably, radioisotopic 
labels are used. 

Inherent to procedures currently used for DNA 
sequence analysis is the necessity to separate either 
radioactively or fluorescently- labelled DNA products by 
a gel permeation procedure such as polyacrylamide gel 
electrophoresis, and then detect their locations relative 
15 to one another along the axis of movement through the 
gel.^ The accuracy of this procedure is determined in 
part by the uniformity of the signal in bands which have 
permeated approximately the same distance through the 
gel. Differences or variations in signal intensities 
20 between nearby bands create several problems. First, 
they decrease the sensitivity of the method, which is 
limited by the ability to detect the bands containing the 
weakest signals. Second, they create difficulties in 
determining whether a band with a weak signal is a true 
25 signal due to the incorporation of a chain terminating 
agent, or an artifact due to a pause site in the DNA 
where the polymerase has dissociated. Third, they 
decrease the accuracy in determining the DNA sequence 
between closely spaced bands since the strong signal of 
30 one band may mask the , weak signal of its neighbor. See 
Tabor and Richardson, supra\ 

Variation in band intensity can arise from an 
inherent property of most DNA polymerases. Most DNA 
polymerases discriminate against the chain terminating 
35 dideoxynucleotides used in DNA sequence analysis. T4 DNA 
polymerase discriminates against ddNTPs to such an extent 
that it cannot be used for DNA sequencing. E. coli DNA 
polymerase I, Taq, and Vent DNA polymerase also 
discriminate strongly against ddNTPs, each incorporating 
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a ddNMP a thousand times slower than the corresponding 
dNTP. Tabor and Richardson supra, (both hereby 
incorporated by reference herein) have shown that T7 DNA 
polymerase lies at the other end of the spectrum, 
5 discriminating against ddNTPs only several fold . If a 
DNA polymerase discriminated against a ddNTP to the same 
extent at all sequences, this problem could be overcome 
by simply altering the ratio of ddNTPs to dNTPs . Such an 
approach has been used with E. cold. DNA polymerase I and 
10 Tag DNA polymerase. However, the extent of 

discrimination varies with the adjacent DNA sequences, 
which leads to wide variation in the intensity of 
adjacent radioactive fragments . The intensity of 
specific fragments can vary by 50-fold for E. coli DNA 
15 polymerase I but only several fold for T7 DNA polymerase. 
Consequently, the intensity of bands on a DNA sequencing 
gel produced by T7 DNA polymerase are of similar 
intensity thus facilitating their detection and analysis 
by automated procedures. In addition, procedures that 
20 even further reduce the discrimination against 

dideoxynucleotides by T7 DNA polymerase are described 
such that it incorporates dideoxynucleotides equally as 
well as deoxynucleotides. These procedures and 

conditions also reduce but do not eliminate 
25 discrimination by other DNA polymerases such as Klenow 
and Taq DNA polymerases. For example, the use of 
manganese in place of, or in addition to, magnesium in 
the reaction mixture may reduce or eliminate 
discrimination against dideoxynucleotides. Under such 
30 conditions, T7 DNA polymerase does not differentiate 
between the two molecules whereas other DNA polymerases 
such as Klenow fragment, Taq and Vent still discriminate 
to some degree. For example, Klenow still discriminates 
against ddNTPs by as much as four- fold in the presence of 
35 manganese. More important, even though the overall 
degree of discrimination by such enzymes as Klenow and 
Taq DNA polymerases is reduced, the intensity of specific 
fragments can vary by much more than four fold due to 
high discrimination at certain sequences in DNA. These 
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polymerases and procedures are now almost universally 
used in manual DNA sequencing (i.e., without aid of 
sequencing machines such as described above) and are 
extensively used in automated methods. The use of 
5 manganese and the lack of discrimination against ddNTPs 
at all sites results in bands of uniform intensities, 
thus facilitating the reading of sequencing gels, either 
by manual or automated procedures. Moreover, the lack of 
discrimination enables the use of novel procedures for 
10 sequence analysis (Tabor and Richardson, supra). A 
method based on this finding is provided to determine a 
DNA sequence in a single reaction that contains all four 
ddNTPs at different ratios, by measuring the relative 
intensity of each peak after gel electrophoresis. The 
15 authors indicate: 

The DNA polymerases of this invention do not 
discriminate significantly between dideoxy- nucleotide 
analogs and normal nucleotides. That is, the chance of 
incorporation of an analog is approximately the same as 
that of a normal nucleotide or at least incorporates the 
analog with at least l/lO the efficiency that of a normal 
analog. The polymerases of this invention also do not 
discriminate significantly against some other analogs. 
This is important since, in addition to the four normal 
25 deoxynucleoside triphosphates (dGTP, dATP, dTTP and 
dCTP) , sequencing reactions require the incorporation of 
other types of nucleotide derivatives such as: 
radioactively- or fluorescently- labelled nucleoside 
triphosphates, usually for labeling the synthesized 
30 strands with "g^ 32p^ other chemical agents. When a 
DNA polymerase does not discriminate against analogs the 
same probability will exist for the incorporation of an 
analog as for a normal nucleotide. For labelled 
nucleoside triphosphates this is important in order to 
efficiently label the synthesized DNA strands using a 
minimum of radioactivity. [4,942,130, COL 5:5] 
They also state: 

The ability to produce nearby bands of approximately 
Che same intensity is useful sxnce it permits che 
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results of any sequencing reaction to be read more easily 
and v,ith greater certainty. Further, since the DNA 
products from a sequencing reaction with a specific chain 
terminating agent form bands which are of approximately 
5 the same intensity as that of nearby bands, band 
intensity itself provides a specific label for the series 
of bands so formed. The number of DNA products of 
approximately the same molecular weight produced by a 
given chain terminating agent varies depending upon the 
10 concentration of the chain terminating agent. Thus, by 
using a different concentration of each of the four chain 
terminating agents for the synthesis the DNA products 
incorporating one chain terminating agent are 
distinguished from DNA products of approximately the same 
15 molecular weight incorporating other chain terminating 
agents in that they differ in number or amount; 
consequently, the bands of DNA products can be identified 
as to chain terminating agent simply by their intensity 
as compared to the intensities of nearby bands. As a 
20 result, two or more series of DNA products, each series 
having a different chain terminating agent, can be 
subjected to gel permeation in a single lane and 
Identified, i.e., distinguished from each other, by the 
intensity of each band as compared to the intensity of 
25 nearby bands. Moreover, the syntheses of DNA products 
incorporating different chain terminating agents need not 
be carried out separately, in separate containers, but 
may all be carried out simultaneously in a single 
reaction vessel, and the same label, e.g., radioisotopic, 
fluorescent, etc. can, if desired, be used for all chain 
terminating agents instead of a different label for each, 
thus simplifying the procedure. [4,962,020, col 3:1-35] 
See also Tabor and Richatdson Proc. Natl, Acad. 
Sci. USA 86, 4076 -4080 (1989) which indicates that 
substitution of manganese ions for magnesium ions for 
catalysis by T7 DNA polymerase or E. coli DNA polymerase 
reduces the discrimination of these polymerases for 
ddNTPs by 4-100 fold, and Tabor and Richardson J. Bdol . 
Chem. 265, 8322-8328 (1990) which describes the use of 
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pyrophosphatase and manganese ions to generate dideoxy- 
terminated fragments of uniform intensity using T7 DNA 
polymerase . 

5 Summary of the Invent inn 

Applicant believes that the lesser utility of some 
DNA polymerases for dideoxy DNA sequencing is due in part 
to the reduced ability of those polymerases to 
incorporate a ddNMP (or other nucleotide analog) in place 
10 of a dNMP. As noted above, the ability not to 
discriminate allows use of lower concentrations of ddNTPs 
than with enzymes that do discriminate, and most 
important it provides banding patterns in sequencing gels 
that are of more uniform intensity along their length. 
15 Both these results make automated sequencing with the 
enzyme easier and more profitable in that longer DNA 
sequences can be determined with greater confidence. 

The present invention provides a method by which an 
otherwise discriminatory DNA polymerase can be altered to 
20 discriminate less against ddNTPs than the corresponding 
naturally occurring enzyme. This method involves the 
modification of the polymerase genetically to provide 
amino acid residues at key locations that enhance the 
ability of the polymerase to incorporate analogs of 
25 dNMPs. Applicant has determined that amino acid changes 
in a specific region of DNA polymerases have a dramatic 
effect on the ability of those DNA polymerases to 
incorporate dideoxynucleotides ; the specific amino acid 
residue inserted determines whether the polymerase is 
30 more or less discriminatory towards ddNTPs. Applicant 
has determined that modifying DNA polymerases so that 
they incorporate dideoxynucleotides more efficiently has 
a tremendous effect on their utility in DNA sequencing. 
It is possible that such modified DNA polymerases will 
35 also prove useful for other common molecular biology 
procedures such as the amplification of DNA (for example 
by the polymerase chain reaction), in vitro mutagenesis 
and filling in the ends of DNA fragments. The 
combination of this technology with existing knowledge 
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for alteration of 3 '-5' exonuclease activity of a DNA 
polymerase (such as T7 DNA polymerase as described by 
Tabor and Richardson, supra) , and the use of manganese 
and pyrophosphatase in sequencing reactions will allow 
5 the production of significantly superior enzymes to those 
presently known. Modification of the DNA polymerase is 
preferably by substitution of one or more amino acids but 
may be by the insertion of one or more additional amino 
acids or the deletion of one or more amino acids. 

10 In one particular aspect, namely, DNA sequencing, 

thermophilic enzymes having an ability to catalyze the 
polymerization of nucleotides at temperatures above 50**C 
and, in particular, above 60**C, 70**C, or even 80°C, under 
conditions used for DNA sequencing, are well known in the 

15 art. Such enzymes are generally present in organisms 
that grow at these temperatures. However, applicant 
believes that many of these enzymes suffer from 
limitations, including the limited ability to incorporate 
dideoxynucleotides . By modification of these enzymes 

20 using methods shown below, those in the art can now 
modify any desired thermophilic DNA polymerase to make it 
incorporate dideoxynucleotides more efficiently. Such 
enzymes will be superior to those existing in the present 
day for DNA sequencing both in automated machines and in 

25 manual sequencing, especially in procedures known as 
cycle sequencing. In cycle sequencing, multiple rounds 
of DNA synthesis are carried out from the same template, 
with the synthesized strand removed after each cycle by 
heat denaturation; this allows much smaller amounts of 

30 DNA template to be used in a sequencing reaction 

Applicant has determined experimentally that amino 
acid residue 526 in the relatively non-discriminating 
enzyme T7 DNA polymerase provides it with this property. 
Applicant has determined that- by modification of residue 

35 526 it is possible to increase the ability of T7 DNA 
polymerase to discriminate many fold. Based on the amino 
acid homologies between T7 DNA polymerase and other DNA 
polymerases, applicant has determined that altering the 
residue at the homologous site in other DNA polymerases 
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likewise affects their ability to discriminate against 
dideoxnucleotides . Examples of such homologous sites are 
residue 762 of E. coli DNA polymerase I and residue 667 
of Taq DNA polymerase. In all three of these examples, 
5 applicant has shown that it is important that the residue 
at this site is different from phenylalanine (F) . e.g., 
it may be tyrosine (Y) as in T7 DNA polymerase . 
Surprisingly, modification of this single amino acid 
residue, even by the addition of a single hydroxyl group, 
0 provides a very large alteration (250 - 8,000 fold) in 
discrimination levels. Those in the art will recognize 
that changes at this one site are not limiting in this 
invention, and changes at other sites that decrease the 
ability of a polymerase to discriminate can now be 
readily found by routine experimentation. For example, 
applicant has found that modification of T7 DNA 
polymerase at 13 other sites also results in an 
increased ability of the enzyme to discriminate against 
ddNTPs, however the effect of alterations at these sites 
is much less, only 5- 20 fold. By use of analogous 
procedures, other sites that effect discrimination 
against ddNTPs can be readily identified in other DNA 
polymerase to make them more useful for DNA sequencing. 
Such other sites include amino acid residues in regions 
highly homologous between E. coll DNA polymerase I and 
T7 DNA polymerase, since these regions are likely to make 
up in part the binding domain for ddNTPs; in E. coli DNA 
polymerase I, these regions include amino acids in 
regions analogous to those in T7 DNA polymerase from 
conserved or non-conserved amino acids in regions 665 - 
6 81 and 754 - 783, and possibly in regions 709 - 734, 
797 - 866, and 913 - 927. The amino acid change to 
provide the desired function can be chosen to be 
identical to the corresponding ammo acids of a non- 
discriminatory enzyme like T7 DNA polymerase, or other 
functionally equivalent amino acids which can be chosen 
by routine experimentation. By changing non-conserved 
amino acids more profound alteration of the ability to 
discrimination is obtained. Non-conserved amino acids 

SUBSTITUTE SHEET (RULE 26) 



10 



15 



WO 96/12042 PCT/US95/12928 

11 

are those which vary from one species of polymerase to 
another (i.e., are found in less than 50% of 
polymerases) . The term "analogous" is used in its 
commonly recognized manner. Thus, an analogue of a Pol 
5 I polymerase is one having an amino acid sequence as 
described by Braithwaite and Ito, infra, which is 
preferably as related to the other members of the Pol I 
family of polymerases described therein as is Spo2 DNA 
polymerase. Such analyses can be performed using 
Felsenstein's PHYLIP program. Id. 

Thus, in a first aspect the invention features a 
modified gene encoding a modified DNA polymerase. The 
gene is modified to produce a modified DNA polymerase 
that, compared to the corresponding naturally-occurring 
or unmodified DNA polymerase, has an increased ability to 
incorporate a dideoxynucleotide compared to a 
deoxynucleotide . 

By "increased ability" is meant that the DNA 
polymerase is able to better incorporate a 
dideoxynucleotide. That is, it discriminates to a lesser 
extent than a corresponding naturally-occurring DNA 
polymerase against a dideoxynucleotide compared to a 
deoxynucleotide. Specific methods for measuring such 
discrimination are provided below. The term "increased" 
means to provide a measurable difference in ability to 
incorporate such dideoxynucleotides . In preferred 
embodiments, this is a increase of at least 10% compared 
to the naturally -occurring enzyme, although it is 
preferred that the level of discrimination against a 
dideoxynucleotide is reduced by at least 10 to 100 fold 
and preferably by 100 - 500 fold. One example of such an 
enzyme is E. coli DNA polymerase I which (as noted 
herein) discriminates approximately 140 - 1,100 fold 
against incorporation of dideoxynucleotides compared to 
35 deoxynucleotides. By the method of this invention an 
enzyme can be derived (by alteration of only one or two 
amino acids) that actually prefers ddNTPs over dNTPS - 
that is, the ability of the polymerase to incorporate 
dideoxynucleotides has been increased by an average of 
40. 1,000 fold. 
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The phrase V corresponding naturally-occurring DNA 
polymerase" is one well known in the art and refers to 
the polymerase found in nature, and which preferably is 
not altered either by in vitro or in vivo manipulations 
5 in the laboratory. Similarly the corresponding nucleic 
acid is that nucleic acid encoding a DNA polymerase found 
in nature. This is simply used as a base line to compare 
modified nucleic acids encoding such polymerases. Thus, 
a base line for the DNA polymerase of rhermus aguaticus 
10 (also termed "Tag") is the nucleic acid which naturally 
encodes Tag DNA polymerase present in the bacterium 
rhermus aguaticus. Applicant provides at least one site 
which can be changed in such polymerases to alter the 
ability of the polymerase to incorporate a 
15 dideoxynucleotide . These sites are merely examples and 
are not limiting in this invention since those in the art 
armed with the knowledge that the ability of a DNA 
polymerase can be usefully altered in this property are 
now provided with the methodology by which to alter such 
20 enzymes either at these specific sites or at other 
egiiivalent sites. 

In embodiments of the invention described herein, 
modification of the DNA polymerase is by sxibstitution of 
one or more amino acids. It may however be found that 
25 the modification may take the form of the insertion of 
one or more additional amino acids or the deletion of one 
or more amino acids. 

DNA polymerases of this invention may also be 
modified to remove or alter an exonuclease domain, such 
30 as the 3' - 5' exonuclease activity described by Tabor 
and Richardson, supra, or the 5' - 3' exonuclease 
activity in Tag described by Barnes (WO 92/06188) . The 
mutations that alter the ability of DNA polymerases of 
this invention to discriminate against ddNTPs preferably 
35 do not affect the exonuclease activity substantially; by 
this it is meant that the mutations are in the polymerase 
domain of the enzyme, near the active site for 
polymerization, and are not decreasing discrimination 
merely by reducing the ability of the polymerase to 
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remove incorporated analogs via its exonuclease activity. 
Particularly suitable DNA polymerases of this invention 
are Pol I - type polymerases as described by Braithwaite 
and Ito, 21 Nuc. Acid. Res, 767, 1993, hereby 
5 incorporated by reference herein, and referred to as 
Family A, and polymerase alpha or polymerase II - type 
DNA polymerases described by Braithwaite and Ito, and 
referred to as family B. The other polymerase families 
described by Braithwaite and Ito may also be used in this 
10 invention. In particular, applicant has found that the 
presence of a polar, hydroxyl containing amino acid 
residue at a position near the binding site for the dNTP 
substrate is important for, the polymerase being able to 
efficiently incorporate a dideoxynucleotide . Without 
15 being bound by any theory. Applicant believes that this 
finding is contrary to the expected result that high 
discrimination against a nucleotide without a hydroxyl 
group at the 3' position of the ribose moiety (i.e. a 
ddNTP) requires the simultaneous absence of a hydroxyl 
20 group on the amino acid residue at this critical site. 
In other words, the presence of the gap, or hole created 
by the absence of both hydroxyl groups leads to 
discrimination against the analog. Knowledge of this 
result provides an approach to finding the critical 
25 residue in even distantly related DNA polymerases; the 
addition of a residue with a polar group for a non polar 
one in the region where the dNTP binds is a useful 
candidate amino acid change for decreasing the ability of 
the polymerase to discriminate against ddNTPs . For 
30 example, the phenylalanine at position 272 of rat DNA 
polymerase b, a DNA polymerase with little if any 
homology to polymerases of Family A or B, has been shown 
by X-ray diffraction studies to be in contact with the 3' 
position of the ddCTP residue in a ternary complex with 
35 a primer- template {Pelletier et al . , 264 Science 189, 
1994). Knowledge of the results described in this 
invention make the modification of this residue to 
tyrosine a logical choice in a screen of mutants of rat 
DNA polymerase b that incorporate dideoxynucleotides more 
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efficiently. . Those in the art will thus be likely to be 
able to alter the discriminatory phenotype of any DNA 
polymerase using the information provided herein. 

The ability of some polymerases of this invention to 
5 incorporate dideoxynucleotides more efficiently can be 
specific (i.e., the effect on dideoxynucleotide analogs 
is much greater than on other analogs) . However, some of 
the polymerases are also useful to aid in incorporation 
of other base modified analogs (e.g., deoxyinosine 
10 triphosphate (dITP) and 2 • -deoxy-V-deazaguanosine 5'- 
triphosphate (dc'^GTP) to remove compression of bands 
during electrophoresis, and f luorescently labeled 
deoxynucleotides or dideoxynucleotides for use in 
automated procedures) . In addition, such polymerases may 
15 be able to incorporate ribonucleotides more efficiently 
thus allowing synthesis of RNA without need for a 
promoter. Specifically, the conserved motifs between 
single -subunit DNA-dependent RNA polymerases such as T7 
RNA polymerase and DNA polymerases of Family A (Pol I - 
20 type DNA polymerase) suggest that mutations in this 
region (residues 758 to 767 of E. col± DNA polymerase I) 
are likely to change the specif icity towards rNTPs. This 
permits the engineering of RNA polymerases that 
efficiently initiated synthesis from a primer, 
25 eliminating the . requirement for a promoter for the 
synthesis of RNA. Analogously, the data provided herein 
suggest that modifying residues 631 to 64 0 of T7 RNA 
polymerase will alter its specificity towards dNTPs . 
This permits the engineering of a new DNA polymerase that 
30 initiates DNA synthesis de novo from a promoter 
sequence, and cannot use a primer. 

In preferred embodiments, the modified DNA 
polymerase has sufficient DNA polymerase activity (e.g., 
at least that used in a standard sequencing reaction, and 
35 preferably at least 100 units/mg of enzyme as defined in 
the art; preferably the mutation in the polymerase does 
not alter the prior level by more than 5-10 fold) for use 
in DNA sequencing (when combined with any host factor 
necessary for that DNA polymerase activity) ; and has 
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sufficiently low. exonuc lease activity (e.g., less than 
5 00 units/mg, see Tabor and Richardson, supra) to allow 
the polymerase to be used in DNA sequencing; the DNA 
polymerase has one or more of the amino acids at the 
5 dideoxynuclebtide binding site of a T7-type DNA 
polymerase (e.g., one selected from the group consisting 
of T7, T3, 01, 011 H, W31,. gh-1, Y, A1122, and SP6). 
Preferably the modified DNA polymerase is modified from 
a thermostable enzyme, such as the DNA polymerase encoded 
10 by Thermus aquaticus, Thermus thermophilus , Theirmus 
flavus. Bacillus sterothermophilus, and Vent bacteria; 
and the ability of the polymerase to incorporate a 
dideoxynucleotide is increased at least 10-fold, 50 -fold 
or most preferably at least 100-fold compared to the 
15 corresponding naturally-occurring DNA polymerasei, e.g., 
by a change at just one amino acid. 

In a second aspect, the invention features a method 
for production of a modified DNA polymerase having an 
increased ability to incorporate a dideoxynucleotide 
20 compared to the ability of a corresponding naturally- 
occurring DNA polymerase . The method includes providing 
a nucleic acid molecule encoding a DNA polymerase and 
mutagenizing or otherwise altering the nucleotide base 
sequence of the nucleic acid molecule to incorporate one 
25 or more base changes in the nucleotide base sequence at 
one or more sites which significantly (i.e., at least 10, 
50 or most preferably 100-500 fold) alter the ability of 
the polymerase encoded by the nucleic acid to incorporate 
a dideoxynucleotide. 
^0 In a third aspect, the invention features a method 

for determining the nucleotide base sequence of a DNA 
molecule. The method includes providing a DNA molecule 
annealed with a primer molecule able no hybridize to the 
DNA molecule; and incubating the annealed molecules in a 
5 vessel containing ac least one deoxynucleotide 
triphosphate, a DNA polymerase modified from a naturally- 
occurring DNA polymerase to have an increased ability to 
incorporate a dideoxynucleotide compared to the 
naturally- occurring polymerase . (The polymerase has 
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sufficient DNA polymerase activity and sufficiently low 
exonuclease activity to be useful for DNA sequencing.) 
Also provided is at least one DNA synthesis terminating 
agent which terminates DNA synthesis at a specific 
5 nucleotide base. The method further includes separating 
the DNA products of the incubating reaction according to 
size, whereby at least a part of the nucleotide base 
sequence of the DNA molecule can be determined. 

In preferred embodiments, the DNA polymerase is a 
10 thermostable DNA polymerase and the sequencing is 
performed at a temperature above 50*^0, 60°C, or 70*^0, and 
the DNA polymerase is derived (i.e., has at least 50% 
identity in. amino acid residues) from one encoded by 
Therznus aguaticus, Thermus thermophilus, Thexmus flavus, 
15 Bacillus sterotherntophllus , Theinnococcus litoralis 
(Vent) , Pyrococcus furiosus (Pfu) or Sulfolobus 
solfataricus. 

in other preferred embodiments, the DNA polymerase 
has less than 1000, 250, 100, 50, 10 or even 2 units of 
20 exonuclease activity per mg of polymerase and is able to 
utilize primers having only 4, 6 or 10 bases; and the 
concentration of all four deoxynucleoside triphosphates 
at the start of the incubating step is sufficient to 
allow DNA synthesis to continue until terminated by the 
25 agent , e • g . , a ddNTP . 

For cycle sequencing, the polymerases of the present 
invention now make it possible to use significantly lower 
amounts of dideoxynucleot ides compared to other enzymes. 
That is, the method involves providing an excess amount 
30 of deoxynucleotides to all four dideoxynucleotides in a 
cycle sequencing reaction, and performing the cycle 
sequencing reaction. For other enzymes, it was necassary 
to add an excess of at least one of the ddNTPs to such 
reactions. For example. Sears et aJ . , 13 BioTechniques 
35 626, 1992 describe use of about a 10 fold excess of 
ddNTPs to dNTPs with Vent polymerase, and Carothers et 
ai., 7 BioTechniques 494, 1989 describe use of at least 
2 fold excess of ddNTPs to dNTPs for Taq polymerase. In 
the present invention, such excess is non needed. 
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Preferably, more than 2, 5, or even 10 fold excess of a 
dNTP is provided to the corresponding. ddNTP. In a 
specific example, less than 10 M ddNTP is used with a 
modified Taq of this invention. 
5 In a related aspect, the invention features a kit or 

solution for DNA sequencing including a modified DNA 
polymerase as described above and a reagent necessary for 
the sequencing selected from the group consisting of 
dITP,. deaza GTP, a chain terminating agent such as a 
ddNTP, and a manganese -containing solution or powder. 

In another aspect, the invention features a method 
for providing a modified DNA polymerase having an 
increased ability compared to the corresponding 
naturally-occurring DNA polymerase to incorporate a 
dideoxynucleotide by providing a nucleic acid sequence 
encoding the modified DNA polymerase, expressing the 
nucleic acid within a host cell, and purifying the DNA 
polymerase from the host cell. 

In another related aspect , the invention features a 
method for sequencing a strand of DNA essentially as 
described above with one or more (preferably 2, 3 or 4) 
deoxyribonucleoside triphosphates, a DNA polymerase as 
described above, and a first chain terminating agent. 
The DNA polymerase causes the primer to be elongated to 
25 form a first series of first DNA products differing in 
the length of the elongated primer, each first DNA 
product having a chain terminating agent at its elongated 
end, and the number of molecules of each first DNA 
products being approximately the same for substantially 
all DNA products differing in length by no more than 20 
bases. The method also features providing a second chain 
terminating agent in the hybridized mixture at a 
concentration different from the first chain terminating 
agent, wherein the DNA polymerase causes production of a 
35 second series of second DNA products differing in the 
length of the elongated primer, with each second DNA 
product having the second chain terminating agent at its 
elongated end. The number of molecules of each second 
DNA product is approximately the same for substantially 

SUBSTIfUrE SHEET (RULE 26) 



IS 



^^'''''"(MZ PCr/US95/12928 

18 

all second DNA products differing in length from each 
other by from 1 to 20 bases, and is distinctly different 
from the number of molecules of all the first DNA 
products having a length differing by no more than 20 
5 bases from that of said second DNA products. 

In preferred embodiments, three or four such chain 
terminating agents can be used to make different products 
as described in Tabor and Richardson, supra; and the 
sequencing reaction is provided with a magnesium ion, or 
10 even a manganese or iron ion (e.gr., at a concentration 
between 0.05 and lOO mM, preferably between l and 10 mM) ; 
and the DNA products are separated according to molecular 
weight in less than four lanes of a gel. 

In another related aspect, the invention features a 
method for sequencing a nucleic acid by combining an 
oligonucleotide primer, a nucleic acid to be sequenced, 
between one and four deoxyribonucleoside triphosphates, 
a polymerase as described above, and at least two chain 
terminating agents in different amounts, under conditions 
favoring extension of the oligonucleotide primer to form 
nucleic acid fragments complementary to the nucleic acid 
to be sequenced. The method further includes separating 
the nucleic acid fragments by size and determining the 
nucleic acid sequence. The agents are differentiated 
from each other by intensity of a label in the primer 
extension products . 

While it is common to use gel electrophoresis to 
separate DNA products of a DNA sequencing reaction, those 
in the art will recognize that other methods may also be 
30 used. Thus, it is possible to detect each of the 
different fragments using procedures such as time of 
flight mass spectrometry, electron microscopy, and single 
molecule detection methods. 

The invention also features an automated DNA 
sequencing apparatus having a reactor including reagents 
which provide at least two series of DNA products formed 
from a single primer and a DNA strand. Each DNA product 
of a series differs in molecular weight and has a chain 
terminating agent at one end. The reagents include a DNA 
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polymerase as described above . The apparatus includes a 
separating means for separating the DNA product along one 
axis of the separator to form a series of bands. It also 
includes a band reading means for determining the 
5 position and intensity of each band after separation 
along the axis, and a computing means that determines the 
DNA sequence of the DNA strand solely from the position 
and intensity of the bands along the axis and not from 
the wavelength of emission of light from any label that 
10 may be present in the separating means. 

In other aspects, the invention features: (a) a 
method for in vitro mutagenesis of a cloned DNA fragment 
by providing the cloned fragment and a DNA polymerase 
described above, contacting the cloned fragment with the 
15 polymerase under conditions for synthesizing a DNA strand 
from the fragment. The conditions cause formation of the 
DNA strand by incorporation of a plurality of individual 
contiguous bases able to base -pair with the fragment and 
incorporation of a nucleotide base unable to base pair 
20 with the fragment; (b) a method for in vitro mutagenesis 
of a template DNA fragment by providing a primer and 
template, the primer having contiguous bases able to 
base-pair with contiguous bases of the template, except 
at least one base which is unable to base -pair with the 
25 template. The method involves extending the primer with 
a DNA polymerase as described above; (c) a method for 
producing blunt -ended double -stranded DNA from a linear 
DNA molecule having a 5' end having a single-stranded 
region. The 3* end of the molecule is double -stranded 
30 and has no 3' protruding termini. The method includes 
incubating the DNA molecule with a DNA polymerase as . 
described above which acts on the single-stranded region 
to produce a blunt -ended double -stranded DNA molecule; 
(d) a method for labeling the 3 • end of a DNA fragment by 
35 incubating the DNA fragment with a DNA polymerase as 
described above, and a labelled deoxynucleotide species 
under conditions in which the 3 ' end of the DNA fragment 
is extended by the polymerase and thereby labelled by 
addition of the labelled deoxynucleotide to the DNA 
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fragment; (e) a method of amplification of a DNA secjuence 
by annealing a first and second primer to opposite 
strands of a double- stranded DNA sequence and incubating 
the annealed mixture with a DNA polymerase as described 
5 above. The first and second primers anneal to opposite 
strands of the DNA sequence with their 3' ends directed 
towards- each other after annealing, and with the DNA 
sequence to be amplified located between the two annealed 
primers . 

10 In yet other aspects, the invention features 

specific DNA polymerases, such asThermus aqua ticus DNA 
polymerase having a tyrosine at residue 667, E. coli DNA 
polymerase I having a tyrosine at residue 762, and any 
Pol I type DNA polymerases having a tyrosine residue at 
15 the analogous location to E. coli DNA polymerase residue 
762, e.g., at the N^ position of the amino acid sequence 
K Ni Nj Ng N4 N5 Nfi N7 Y G, wherein each N is independently 
any amino acid. Furthermore, this invention features 
specific polymerases of the DNA polymerase alpha family 
20 having the sequence K N^ Nj N3 N4 N5 N^ Y G/Q, wherein each 
N is independently any amino acid, and where one of the 
residues Ni to N7 has been mutated to produce a polymerase 
that has reduced discrimination against ddNTPs 
(preferably reduced by at least 2 0- fold compared to the 
25 non-mutated sequence) . The invention also features 
nucleic acid encoding any of these DNA polymerases. 

In related aspects, the invention features DNA 
polymerases except reverse transcriptase which in the 
presence of magnesium as the only added divalent cation 
30 have an average processivity of less than 100 and 
discriminate less than 100 times against incorporation of 
a ddNMP compared to a dNMP, or which in the presence of 
magnesium as the only added divalent cation have an 
average processivity of less than 50 and discriminate 
35 less than 50 or 5 times against incorporation of a ddNMP 
compared to a dNMP. Those in the art will recognize that 
processivity can be measured by any standard procedure 
that will indicate that the average processivity of T7 
DNA polymerase is at least 500, that of Klenow fragment 
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is about 4 - 40, and for reverse transcriptase it is 
about 150 - 200, Such measurements can be performed as 
described by Tabor et al . , J. Biol. Chem. 262: 16212, 
1987, hereby incorporated by reference herein. The 
5 average processivity of Taq DNA polymerase under these 
conditions is expected to be less than 100. 

In particularly preferred aspects, the invention 
features thermophilic DNA polymerases that discriminate, 
e.g., in the presence of magnesium, against a ddNMP 
10 compared to a dNMP by less than a factor of 100, and 
which preferably have an average processivity less than 
100, and cycle from one primer-template to another more 
than once per one or even ten seconds. Such cycling can 
be measured by standard procedures. 
15 The invention also features a method for cycle 

sequencing using a DNA polymerase as described above, and 
also features cellular {as opposed to viral or 
mitochondrial) DNA polymerases having a tyrosine in place 
of the naturally occurring amino acid at a location which 
20 causes the polymerase not to discriminate against a ddNMP 
compared to a dNMP by more than 50 fold. 

In other aspects, substitution of the amino acid at 
the noted sites will result in alteration of other 
properties of the corresponding natural polymerase. In 
25 addition, polymerases of this invention may be combined 
with other polymerases in the methods described herein to 
take advantage of the superior properties of each 
polymerase in the mixture. 

Other features and advantages of the invention will 
30 be apparent from the following description of the 
preferred embodiments thereof , and from the claims. 
Description pf the Preferred Embgdiments 

The drawings will first briefly be described. 

35 Drawings 

FIG. 1 is a diagrammatic representation of the amino 
acid sequence of the DNA polymerase encodecJ by gene 5 of 
bacteriophage T7 indicating the palm and finger domains, 
and the location . of various dideoxy resistant (DR) 
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mutants, the location of regions labelled A-E, and the 
location of one site involved in ddNTP discrimination; 

FIG. 2 is a three dimensional representation of the 
structure of DNA polymerase I showing the locations of 
5 regions A-E; 

FIG. 3 is a diagrammatic representation of a 
riboselectivity region of pol I-type DNA polymerases with 
amino acids indicated by the universal single letter 
code. The initial amino acid number is indicated on the 

0 left of the figure and the amount of discrimination 
against dideoxynucleotides compared to deoxynucleotides 
is indicated on the right; 

FIGS. 4, 5, and 6 are diagrammatic representations 
showing modifications of the riboselectivity region of E. 

5 coll DNA polymerase I, T7 DNA polymerase, and Taq DNA 
polymerase, respectively. 

Dideoxv Resist^ant^ Mu^anhg 

The following is a brief discussion of publications 
► of some relevance. None are admitted to be prior art to 
the pending claims but are provided to aid in 
understanding the present invention. 

Reha-Krantz et al . , Mutational Analysis of 
Bacteriophage T4 DNA Polymerase, from Abstracts for 
Poster Presentations, presented at a meeting entitled 
"The Fidelity of DNA Synthesis: Structural and 
Mechanistic. Perspectives, Beaufort, North Carolina, 
September 24-29, 1989 describe C- terminal mutants with 
increased utilization of ddNTPs. However, Reha-Krantz et 
ai., J. Virology 67, 60-66 (1993), indicate that while 
the Ki for ddGTP was 50 times lower with the mutant L412M 
compared to wild-type T4 DNA polymerase, no difference in 
efficiency of incorporation of ddGTP was found between 
the mutant and wild-type T4 DNA polymerase. On page 63, 
it states that "Despite the sensitivity of the L412M DNA 
polymerase zo ddGTP, there was no difference found in the 
incorporation of ddNTPs by wild-cype and L412M DNA 
polymerases". It also states "it does not appear that 
any single region is the sole binding site for either PPi 
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or nucleotides". In addition, Reha-Krantz and Nonay, J", 
flioi. Chem. 269, 5635-5643 (1994), provide a study of the 
mutant L412M and other mutant T4 DNA polymerases. 

Gibbs et al., Proc. Natl. Acad. Sci. USA 85, 6672- 
5 6676 (1988) and Larder et al . , the EMBO Journal 6, 169- 
175 (1987) describe the spectrum of mutations obtained in 
Herpes DNA polymerase when selected for resistance to a 
number of nucleotide analogs: pyrophosphate, 
phosphonoacetic acid, and phosphonof ormic acid, 
10 acyclovir. vidarabine. ganciclovir and 

bromovinyldeoxyuridine. It indicates that many of the 
mutants resistant to one drug are also resistant to other 
drugs, even when they are analogs to different regions of 
the sxibstrate . 

15 Derse et al., j. Biol. Chem. 257, 10251-10260 

(1982). describe five classes of mutants in Herpes 
Simplex DNA polymerase isolated by selection for growth 
in the presence of phosphonof ormic acid, a pyrophosphate 
inhibitor. For mutants in each class, they compare 
20 resistance to ddGTP (page 10256, Table III). All 
increase the Ki for ddGTP by 20 to 100 fold. 

Prasad et al., Proc. Natl. Acad. Sci. USA 88, 11363- 
11367 (1991) use a direct screening strategy and show 
that a single mutation in HIV reverse transcriptase (a 
change of Glu 89 to Glycine) renders the polymerase more 
resistant to ddGTP (requires about 10 times more ddGTP to 
obtain same extent of inhibition) . This mutation confers 
a broad resistance to a number of analogs, including 
phosphonof ormic acid, a pyrophosphate analog. While the 
mutant was equally resistant to ddTTP, ddCTP and ddGTP, 
it was much less resistant to ddATP. 

Song et al . , J. Virol. 66, 7568-7571 (1992) mutate 
glu- 8 9 of human immunodeficiency virus type 1 reverse 
transcriptase to 9 different amino acid residues, and 
measure the resistance of each mutant enzyme to ddGTP and 
phosphonof ormic acid, a pyrophosphate analog. The 
mutations fell into two classes; replacement of Glu- 8 9 
with alanine, glycine, valine or threonine resulted in 
enzymes highly resiscanc to both ddGTP and 
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phosphonof ormic acid compared co the wild- type enzyme, 
while mutation to serine, glutamine, asparagine, aspartic 
acid, and lysine resulted in enzymes with only moderate 
or no resistance to ddGTP. None of the mutants made the 
5 enzyme less resistant to ddGTP than the wild-type enzyme 
(Table I, page 7569) . The authors speculate that the 
89th and 90th residues of reverse transcriptase form a 
portion of the dNTP-binding pocket based on their results 
and the- crystal structure of reverse transcriptase. 
^0 Papers concerned with the properties of E. coli DNA 

polymerase I mutant proteins with mutations in the 
vicinity of the mutation that result in ddNTP selectivity 
including the following: 

Carroll et al . , Biochemistry 20, 804-813 (1991) 
15 study two mutants: Tyr766Ser and Tyr766Phe for 

misincorporation of normal deoxynucleotides . Polesky et 
al,, J. Biol.Chem. 265, 14579-14591 (1990), characterizes 
mutations that have two different properties: (1) 
Tyrosine 766, arginine 841, and asparagine 845; which the 
20 authors suggest that these residues contact the incoming 
dNTP. (2) Glutamine 849, arginine 668, and aspartic acid 
882, which the authors suggest are involved in catalysis. 
Polesky et ai . , J. Biol. Chem. 267:8417 (1992), further 
characterizes mutations in arginine 668, glutamine 849, 
25 and aspartic acid 882, and also mutations at aspartic 
acid 705 and. glutamic acid 710. In this study the 
authors look at the incorporation of alpha -thio- 
substituted dNTPs, i.e. analogs in the phosphate moiety, 
Pandey et a J . , J. Biol. Chew. 269, 13259-13265 (1994) look 
30. at two mutants in E. coli DNA polymerase I that change 
lysine 758 to alanine and arginine. The authors indicate 
that Basu et ai . , Biochemistry 26, 1704-1709 (1987) 
implicate the same lysine 758 in dNTP binding. This was 
shown chemically; DNA polymerase I was covalently 
35 modified using pyridoxal 5 ' -phosphate , a nucleotide 
analog, and lysine 758 was said to be the residue 
modified. 

Beese et al . , Biochemistry 321: 14095-14101 (1993) 
describe the structure of a cocrystal of Klenow fragment 
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of DNA polymerase I complexed with a dNTP or with 
pyrophosphate. The authors state that the dNTP binds 
adjacent to helix O. The authors make the following 
statements: (a) "In the Mg-dCTP complex, cytosine 
5, interacts with His. 881, while the sugar interacts with 
Phe 764 [sic, 762] (Figures 3 and 5) (b) "However, we 
conclude that the position of at least the dNMP moiety of 
dNTP in the binary complex is not likely to be the same 
as in its catalytically relevant complex with primer- 
10 template DNA." (c) "Since the entire binding site for 
the base of dNTP is formed by its Watson-Crick hydrogen 
bonding to the template strand and its stacking on the 3 ' 
base of the primer strand, it is not unlikely that the 
binding site for the base in the binary complex is 
15 completely adventitious, consistent with our observation 
that it can bind in several locations dependent on 
conditions." (d) "The binding site for dNTP observed in 
crystals of the binary complex is the same as observed in 
solution studies. However, extrapolating from this 
20 binary complex to a model for the complex with dNTP in 
the presence of primer and template DNA requires 
considerable caution. We presume that the sugar and base 
moiety of the dNTP require primer- template DNA to bind in 
correct conformation." 
25 Joyce and Steitz, 63 Ann. Rev. Bioc. Ill , 1994 (not 

admitted to be prior art to the present invention) 
discuss the relationship of various DNA and RNA 
polymerases. It indicates three functions for the "palm" 
(rather than the **finger") subdomain of DNA polymerase I 
30 - namely, the catalytic center, the binding site for the 
3' terminus of the primer, and the dNTP binding site. In 
HIV-1 reverse transcriptase it indicates that mutations 
that influence binding of DNA polymerase inhibitors are 
around residues 67 - 70. It also states that " [a] Ithough 
35 no useful conclusions could be drawn from the positions 
of the nucleotide base of sugar, ic is possible that the 
crystalline binary complex may be informative in 
identifying contacts between Klenow fragment and the dNTP 
phosphate groups." In the preceding paragraph, it states 
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"although a polymerase -dNTP binary complex can be formed, 
such a complex is not catalytically competent." It 
further indicates that data "would place the deoxyribose 
close to Phe 762" and that ^'Mutation of Tyr 766 [in 
5 Klenow fragment helix O] , which is located in the fingers 
domain in the vicinity of the model -built template 
strand, affects the discrimination between deoxy and 
dideoxy nucleotide substrates . , However, it also 

states **In Klenow fragment, mutations that have been 
10 found to affect the binding of dNTP in the ternary 
complex (as reflected in K^iomv)) are located on one side 
of the polymerase cleft within or close to the fingers 
subdomain. Positions identified thus far encompass the 
N terminus of helix Q (Arg 841 and Asn 84 5) , the exposed 
15 face of helix O (Tyr 766, Phe 762, and Arg 754), and 
neighboring residues closer to the catalytic center (Asp 
705 and Glu 710) ... An advantage of the kinetic 
approach is that it probes the ternary complex; however, 
as discussed above, it is impossible, in the absence of 
20 other structural evidence, to distinguish direct effects 
from those mediated via template interactions. Moreover, 
the side chains listed above encompass an area much 
larger than the dNTP molecule and therefore cannot all be 
in direct contact with it. Since the region of Klenow 
25 fragment implicated by these studies is thought to make 
extensive contact with the template strand, a reasonable 
interpretation is that a subset of the residues mentioned 
above are in direct contact with the dNTP, while the 
remainder bind the template DNA," 
30 Pelletier et al . , 264 Science 1891, and Sawaya et 

al . , 264 Science 1930, 1994 (not admitted to be prior 
art to the pending claims) in contrast, indicate that 
residues 271 - 274 in helices M-N of PolS (which are 
analogous to helices J-K of Klenow) "perform a common 
35 function, nucleotide discrimination . " 

Sousa et a J . 364 Nature 593, 1993 (not admitted to 
be prior art to the pending claims) describe the three 
dimensional structure of T7 RNA polymerase and its 
homology to E. coli DNA polymerase I. They state that 
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their observations suggest the "C-tenninal elements of KF 
[Klenow fragment] (b-strand 14 [residues 916 to 928] and 
the C terminal) contact the deoxyribose moiety of the 
dNTP during polymerization to discriminate between rNTP 
5 and dNTP substrates*" 

Dideoxynucleosides, such as dideoxythymidine, are 
potent inhibitors of T7 phage growth. Experiments 
indicate that the inhibition of DNA synthesis is a 
consequence of the incorporation of the dideoxynucleotide 

10 into T7 DNA. Dideoxynucleosides are not inhibitory to 
uninfected E* coli. We do not know the explanation for 
the lack of inhibition of E. coli DNA synthesis but it 
could be explained by cellular uptake, a high level of 
discrimination against their incorporation by E. coli DNA 

15 polymerase III/ inefficient phosphorylation to a 
triphosphate, or efficient removal. In any case we find 
that T7 mutant phage arise that can yield normal plaques 
on agar plates containing dideoxynucleosides with a 
frequency of approximately 10'^. The location of many of 

20 these mutations are shown in Fig. 1. They reside within 
the gene 5 protein. The mutant gene 5 proteins are more 
discriminatory (by a few fold) against ddNTPs than the 
native gene 5 protein. Some members of this class of 
mutants may delineate the region of the polymerase that 

25 is important in recognition of the ribose moiety of the 
dNTP. 

It is important to note that the mutations obtained 
by this selection using dideoxynucleosides are based on 
an alteration of the region of the gene 5 protein that 

30 recognizes the ribose moiety. However, it is possible 
that such mutations will also have dramatic effects on 
other nucleotide analogs. In addition, it is possible to 
use the same procedure to select for other T7 mutants 
that discriminate strongly against other nucleotide 

35 analogs on the basis of the growth of phage in the 
presence of the other analogs. 

Referring to Table I, various DR mutants are 
indicated with the amino acid substitution noted in the 
table. The amino acid substitution is further 
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characterized on the right hand side of the table. The 
location of these mutants is shown in FIG. 1 throughout 
the palm and finger regions of T7 DNA polymerase. The 
thumb region is a flexible region that interacts with 
5 minor groove of product duplex DNA; the palm region is 
the catalytic center, the binding site for the 3 ' end of 
the primer and contributes to dNTP binding; the finger 
regions interact with ss template close to the site. of 
synthesis and contribute to dNTP binding. These mutants 

10 are highly scattered throughout the polymerase and all 
have a relatively minor effect on incorporation of 
dideoxynucleotides, decreasing the ability to incorporate 
a dideoxynucleotide by only 5-20 fold. In addition, some 
of these mutants are located in regions of comparative 

15 non-homology to DNA polymerase I . Thus, they do not 
provide an indication of the location of sites in other 
Pol I enzymes involved in ddNTP discrimination. 

Tat?le 1 

20 

Summary of T7 DNA Polymerase Dideoxy-Resistant Mutants 



tlutant Hg. iaolatea wodifimunn 



DRl 


1 


Ala 425 


Thr 


Hydrophobic Polar 


DR2 


1 


Phe 434 


Ser 


Hydrophobic Polar 






GXy 442 


Ser 


Hydrophobic Polar 


DR3 


1 


Val 443 


lie 


Hydrophobic Hydrophobic 


DR4 


2 


Arg 444 


His 


Strong basic Weak basic 


DR5 


1 


Arg 444 


Cys 


Strong basic Neutral, polar 


DR6 


e 


Ser 4 7-7 


Phe 


Polar Hydrophobic 


DR7 


4 


Asp 504 


Asn 


Basic Neutral 


DR8 


2 


Ala 513 


Thr 


Hydrophobic Polar 


DR9 


2 


Thr 517 


lie 


Polar Hydrophobic 


ORIO 


1 


Ala 532 


Ser 


Hydrophobic Polar 


DRl I 


1 


Arg S€S 


Cys 


Strong basic Neutral, polar 


DRl 2 


1 


Ala 619 


Thr 


Hydrophobic Polar 


DR13 


3 


Ala 700 


Thr 


Hydrophobic Polar 
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SunriTTia-rY 

7 Hydrophobic Polar 

3 Strong basic Neutral, polar or weak basic 

2 Polar Hydrophobic 

1 Hydrophobic Hydrophobic 

In vitro M»taq.>n«>j^i 

In vitro mutagenesis of the cloned gene 5 of T7 was 
used to construct gene 5 proteins in which different 
regions of B. coli DNA polymerase I were substituted for 
the analogous or homologous regions in T7 gene 5 protein. 
As discussed. we were particularly interested in 
determining the ability of these enzymes to incorporate 
nucleotide analogs and the extent to which they 
discriminate against these analogs . Referring to FIG . 2 . 
the regions within which hybrids between T7 DNA 
polymerase and E. coli DNA polymerase I were made are 
shown as marked as regions A-E. 

Referring to FIG. 3, Applicant has determined that 
region C provides a riboselectivity region having a 
significantly greater effect than other regions in the 
polymerase. Some of these other regions are specifically 
indicated in FIG. 3. 

Referring to FIGS. 4-6 (and Table 2) , it was 
determined that substitution of amino acids in this 
region allowed conversion of the riboselectivity of a 
polymerase from a E. coli DNA pol I -type to a T7 DNA 
polymerase type and vice versa. Thus, by targeted 
mutagenesis of this region of poll -type polymerases the 
riboselectivity of a polymerase can be significantly 
altered. The level of effect is at least 50-100 fold, 
and generally more than 500 -fold. 

DNA Polvmerasps 

DNA polymerases useful in this invention include 
those belonging to the class of homologous polymerases 
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termed "Pol I -type DNA polymerases" including T7-type DNA 
polymerases, the large fragment of E. coli DNA polymerase 
I, and Taq polymerase, 

DNA polymerase useful in this invention include 
those belonging to a class of homologous polymerases 
including T7 type DNA polymerases (such as T7, T3 , 01, 
011, H W31, gh-1, Y, A1122, or SP6) . By homologous 
polymerases is meant an enzyme such as those described by 
Delarue et al . , Protein Engineering 3, 461-467 (1990), 
that presents an alignment of the Pol I family of DNA 
polymerases. It also presents the alignment of 

Table 2. 

Effect of domain exchanges between E. coli DNA polymerase 
I, T7 DNA polymerase, and Taq DNA polymerase within helix 
0 on discrimination against ddNTPs . The sequence of the 
three polymerases are shown at the top, with the number 
of the first residue indicated. Below the consensus 
sequence for these three polymerases the mutants 
characterized in T7 DNA polymerase (T7) , E. coli DNA 
polymerase I (Pol) and Taq DNA polymerase (Taq) are 
shown, with the mutated residues underlined. Each mutant 
was tested for its relative rate of incorporation of 
ddNMP to dNMP by SDS activity gel analysis as described 
in Example 2 and indicated on the right . Mutants T7 C- 
T8, Pol I C-K6, and Taq C-Q5 were purified along with the 
wild- type proteins for further analysis. 

ddNTP 

Enzyme Sequence Discrimination 



Pol I 754 RRSAKAINFGLIYG 

SUBSTITUTE SHEET (RULE 26) 



High 



wo 96/12042 



PCTAJS9S/12928 



31 

Taq 658 RRAAKTINFGVLYG High 

T7 517 RDNAKTF lYGFLYG Low 

Consensus R A K G Y G 

T7WT RDNAKTFIYGFLYG Low 

f7 C-T2 R K_S A K A I N F G L_I Y G High 

T7 C-T3 R R S AKTFIYGFLYG Low 

T7 C-T4 R D N A K A T W F G F L Y G High 

T7 C-T5 R D N A K A I I Y G F L Y G Low 

T7 C-T6 RDNAKTF N F G F L Y G High 

T7 C-T7 RDNAKTFHYGFLYG Low 

T7 C-T8 RDNAKTF I £ G F L Y G High 

Pol IWT RRSAKAINFGLIYG High 

Pol I C-Kl R ILJS A K T F T V G £_L Y G Low 

Pol I C-K2 R R S A K T F T Y G L I Y G Low 

Pol I C-K3 R R S A K T F N F G L I Y G High 

Pol I C-K4 R R S A K A I I Y G L I Y G Low 

Pol I C-K5 RRSAKAIXFGLIYG High 

Pol I C-K6 R R S A K A I N X G L I Y G Low 

Taq WT RRAAKTINFGVLYG High 

Taq C-Ql R D N AKTINFGVLYG High 

Taq C-Q2 R R A A K T F T Y G £ L Y G Low 

Taq C-Q3 R R A A K T I T Y G V L Y G Low 

Taq C-Q4 RRAAKTIIFGVLYG High 

Tqa C-Q5 RRAAKTINFGVLYG Low 

Specificity residue 

conserved sequence motifs from six families of 

polymerases: DNA polymerases from Pol I, Pol alpha, and 
Pol beta families, DNA-dependent RNA polymerases, reverse 
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transcriptases/ and RNA- dependent RNA polymerases;- their 
results suggest a few residues are conserved between all 
polymerases. According to their alignment (Figure 3, 
page 463) , the selectivity residue identified herein 
5 (phenylalanine 762 in E. coli DNA polymerase I) is in 
"Motif B". In addition to the pol I family of DNA 
polymerases. Motif B is found in the pol alpha family of 
DNA polymerases and the T7 family of DNA-dependent RNA 
polymerases; thus this alignment strongly suggests the 
10 residues one should mutate in these other families of 
polymerases, that include T4 DNA polymerase. Herpes DNA 
polymerase, 029 DNA polymerase. Vent DNA polymerase and 
Pfu DNA polymerase. 

In addition, Joyce, Current Opinion in Structural 
15 Biology 1, 123-129 (1991) compares the DNA sequences from 
many polymerases, and suggests that a small number of 
important active-site residues are conserved. In 
particular, there is a discussion of the similarities 
between polymerases of the pol I family {T7, pol I, Tag) 
20 and of the pol alpha family (T4, 029, Herpes). In Figure 
1 (page 124) Joyce indicates the positions of 5 invariant 
residues found in these two families; they include lysine 
758, tyrosine 766, and glycine 767; these are all very 
close to the selectivity residue identified herein, 
25 phenylalanine 762 . 

These polymerases are used in a DNA sequencing 
reaction under conditions in which they preferably 
produce nearby bands of approximately uniform intensity 
(with about a 1.5- to 2.0-fold variation in intensity) 
3 0 when the DNA products of the sequencing reaction are run 
in a gel , By nearby is meant to include bands 
representing DNA products of molecular weight differing 
by as much as 6000, i.e., 2 0 bases. The actual value of 
this intensity will decrease along the length of the gel, 
35 as described in Tabor and Richardson, supra. Band 
intensity reflects the number of DNA products within a 
certain band. Labels such as fluorophores or 

radioisotopes, are used to produce a readily detectable 
band of intensity reflective of the number of such DNA 
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products. Thus, in this invention, nearby bands derived 
from one sequencing reaction with one chain terminating 
agent have approximately the same number of DNA products 
and thus a uniform band intensity. The sequencing 
5 conditions include incubation of the polymerase in the 
presence of specific divalent or trivalent cations such 
as manganese {II and III), ferrous and ferric ions; 
monovalent and divalent cations which have no detectable 
effect, or are detrimental to DNA synthesis, include: K, 
Na, Ba, Be, Ca, Cc, Cr, Co, Cu, Ni, Si and Zn. The anion 
is unimportant, for example, chloride, acetate, and 
sulfate are suitable. Under these conditions the 
requirement for chain terminating agents, such as 
dideoxynucleosides, may be lessened by several -fold for 
15 enzymes for this invention. A chelator may also be 
provided in this solution in order to help regulate the 
concentration of available divalent metal ions. For 
example, citrate or isocitrate may be provided. These 
chelates are thought to maintain the level of, for 
20 example, free manganese ions at a concentration of 
between 10 and 100 uM over a wide range of manganese 
concentrations. That is, the chelator acts as a buffer. 

The DNA polymerases of this invention do not 
discriminate significantly between dideoxynucleotide 
25 analogs and deoxynucleotides along the length of the DNA 
template. That is, these polymerases are unable to 
discriminate significantly becween a nucleotide that has 
a 3' hydroxyl group versus one that does not (i.e., has 
two hydrogens at the 3' position of the ribose) . 
3 0 However, these polymerases may discriminate against 
modifications at other positions on the nucleosides, even 
in the presence of manganese or iron. For example, the 
polymerases may discriminate against some 
dideoxynucleotide analogs which have fluorescent groups 
35 attached compared to deoxynucleotides. However, the 
polymerases do not discriminate to a different extent at 
neighboring, or nearby nucleotides, on the basis of the 
presence or absence of the modification to the 
dideoxynucleotide. Thus, while they discriminate 
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Strongly against these analogs, requiring higher 
concentrations for a DNA sequencing reaction compared to 
unmodified dideoxynucleosides , the intensity of nearby 
bands will still be uniform. 
5 Thus, the polymerases of this invention provide a 

uniform efficiency of incorporation of chain terminating 
agents, even if they discriminate against overall 
incorporation. In addition, other polymerases of this 
invention will give more uniform bands with fluorescent 
10 ddNTPs than the corresponding naturally occuring enzyme, 
although not as uniform as with unlabelled or 
radioactively labelled ddNTPs. 

Chain terminating agents useful in this invention 
include dideoxynucleosides having a 2', 3*dideoxy 
15 structure. Other agents useful in the invention are 
those able to specifically terminate a DNA sequencing 
reaction at a specific base, and are not discriminated 
against by the polymerase under the above conditions. 

In order . to determine whether any particular DNA 
20 polymerase, in combination with any particular chain 
terminating agent, or other component of a sequencing 
reaction mixture, is useful in this invention, a standard 
sequencing reaction is performed, as described in Tabor 
and Richardson, supra, and the extent of band formation, 
25 and the uniformity of nearby bands in a sequencing gel, 
reviewed. If the polymerase reaction does not extend the 
primer by at least 20 bases, it is not suitable under the 
conditions used. Adjacent band uniformity within a two- 
fold or less range is useful in this invention, 
30 preferably the uniformity is within a 1.0-1.5 fold range. 
Similarly, determination of optimum cation concentration, 
or of other potential cations useful in the invention, is 
determined by use of this sequencing reaction under 
various conditions. For example, cations are tested in 
35 ranges from 0.005-100 mM. An example of such an 
experiment follows: 

The ability to incorporate a given ddNMP compared to 
the corresponding dNMP for any one enzyme is measured as 
the ratio of ddNTP to dNTP necessary to allow DNA 
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synthesis that terminates in a fixed range, detected as 
producing bands of no greater than a fixed molecular 
weight. That is, the bands produced in the reaction end 
within a specified range in the sequencing gel. Thus, if 
5 one enzyme discriminates 1000 -fold greater against a 
given ddNTP compared to another enzyme, a 1000-fold 
higher ratio of ddNTP to dNTP will be necessary to obtain 
bands terminating at the corresponding sites in the same 
range of the gel • 

10 

Exonuclease Agt:iv^^y 

The DNA polymerases of the invention preferably have 
less than 50%, preferably less than 1%, and most 
preferably less than 0.1%, of the normal or naturally 
15 associated level of exonuclease activity (amount of 
activity per polymerase molecule) . By normal or 
naturally associated level is meant the exonuclease 
activity of e.g., an unmodified T7-type polymerase. 
Normally the associated activity is about 5,000 units of 
20 exonuclease activity per mg of polymerase, measured as 
described below by a modification of the procedure of 
Chase et al . (249 J. Biol, Chem. 4545, 1974). 
Exonucleases increase the fidelity of DNA synthesis by 
excising any newly synthesized bases which are 
25 incorrectly basepaired to the template. Such associated 
exonuclease activities can be detrimental to the quality 
of DNA sequencing reactions. They raise the minimal 
required concentration of nucleotide precursors which 
must be added to the reaction since, when the nucleotide 
30 concentration falls, the polymerase activity slows to a 
rate comparable with the exonuclease activity, resulting 
in no net DNA synthesis, or even degradation of the 
synthesized DNA. 

More importantly, associated exonuclease activity 
3 5 may cause a DNA polymerase to idle at regions in the 
template with secondary structure impediments. When a 
polymerase approaches such a structure its rate of 
synthesis decreases as it attempts to pass. An 
associated exonuclease will excise the newly synthesized 
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DNA when the polymerase stalls. As a consequence 
numerous cycles of synthesis and excision will occur 
Th^s may result in the polymerase eventually synthesizing 
past the hairpin (with no detriment to the quality of the 
5 sequencing reaction) ; or the polymerase may dissociate 
from the synthesized strand (resulting in an artifactual 
band at the same position in all four sequencing 
reactions); or, a chain terminating agent may be 
incorporated at a high frequency and produce a wide 
variability in the intensity of different fragments in a 
sequencing gel. This happens because the frequency of 
xncoxporation of a chain terminating agent at any given 
sxte increases with the number of opportunities the 
polymerase has to incorporate the chain terminating 
15 nucleotide. 

An ideal sequencing reaction will produce fragments 
that gxve bands of uniform intensity throughout the gel 
Thxs xs essential for obtaining the optimal exposure of 
the X-ray film for every radioactive fragment. If there 
xs variable intensity of radioactive bands, then fainter 
bands may go undetected. To obtain uniform radioactive 
intensity of all fragments, the DNA polymerase should 
spend the same interval of time at each position on the 
DNA, showing no preference for either the addition or 
removal of nucleotides at any given site. This occurs if 
the DNA polymerase lacks any associated exonuclease, so 
that xt will have only one opportunity to incorporate a 
chaxn terminating nucleotide at each position along the 
template. 



20 



25 



30 



The DNA polymerase of the invention is preferably 
able to utilize primers of lo bases or less, as well as 
longer ones, most preferably of 4-20 bases e.g., 6 bases 
35 (which can be used in groups of three to form an 
equivalent of an i8-mer) . The ability to utilize short 
prxmers offers a number of important advantages to DNA 
sequencing. The shorter primers are less expensive and 
easxer to synthesize than the usual i7-mer Drimers. They 
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also anneal faster to complementary sites on a DNA 
template, thus making the sequencing reaction faster. 
Further, the ability to utilize small (e.g., six or seven 
base) oligonucleotide primers for DNA sequencing permits 
5 strategies not otherwise possible for sequencing long DNA 
fragments. For example, a kit containing 80-4000 random 
hexamers could be generated, none of which are 
complementary to any sites in the cloning vector. 
Statistically, one of the 80 hexamer sequences will occur 
an average of every 50 bases along the DNA fragment to be 
sequenced. The determination of a sequence of 3000 bases 
would require only five sequencing cycles. First, a 
"universal" primer (e.g.. New England Biolabs #1211, 
sequence 5- GTAAAACGAACGGCCAGT 3') would be used to 
sequence about 600 bases at one end of the insert. Using 
the results from this sequencing reaction, a new primer 
would be picked from the kit homologous to a region near 
the end of the determined sequence. In the second cycle, 
the sequence of the next 600 bases would be determined 
using this primer. Repetition of this process five times 
would determine the complete sequence of the 3000 bases, 
without necessitating any subcloning, and without the 
chemical synthesis of any new oligonucleotide primers. 
The use of such short primers may be enhanced by 
25 including gene 2 . 5 and gene 4 protein of T7 in the 
sequencing reaction i 

In vitro Miirarr^n>>c^-j ^ 

Mutagenesis of the polymerase genes was carried out 
3 0 using standard PGR techniques (see below). 

Discrimination aaaing^ ddNTPg 

In the presence of magnesium as the only divalent 
cation. T7 DNA polymerase discriminates about 3 - 4 fold 
against ddNTPs, less than any other known polymerase. 
The next closest is reverse transcriptase, that 
discriminates about 10 - 50 -fold against ddNTPs (3 - lo 
times more than T7 DNA polymerase) . After these two, all 
other known DNA polymerases characterized in the 
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literature discriminate at least 100-fold against ddNTPs. 
and often 10,000 fold or more. 

In- the presence of manganese discrimination by T7 
DNA polymerase and E. coli DNA polymerase I are reduced; 
5 with T7 DNA polymerase, it is reduced from 3.7 to 1, and 
with B. coli DNA polymerase I it is reduced from 550 to 
3.9 (for ddATP) . Applicant is the first to provide a DNA 
polymerase that in the presence of magnesium ions as the 
only divalent cations has a processivity of less than 100 
10 (defined as the average length of extension from a given 
primer before dissociating from the primer- templat e ; 
Reverse transcriptase has a processivity by this 
definition of about 150 - 200, and T7 DNA polymerase has 
a processivity greater than this) and that discriminates 
15 less than 100 fold against incorporation of a ddNMP. In 
contrast , most of the known DNA polymerases , such a Taq 
that have a processivity less than 100 discriminate more 
than 100 fold against incorporation of a ddNMP. 

Previously it was believed that whereas polymerases 
20 with high processivity such as T7 DNA polymerase remain 
bound to a primer -template for up to several minutes, 
polymerases with low processivity such as E. coll DNA 
polymerase I cycle from one primer template to another 
every few seconds (or over one hundred times more 
25 frequently). See for example Tabor, et al . J. BioJ . 
Chem. 262, 16212-16223 (1987)). While processivity is 
advantageous for DNA sequencing, as in reducing the 
background due to terminations not from dideoxy 
incorporation, the slow cycling time is a disadvantage. 
iO For example, if the polymerase does dissociate at 
specific sequences, it will result in strong art if actual 
bands on a sequencing gel unless there is a large excess 
of polymerase present. On the other hand, with a 
polymerase that cycles rapidly, one can use much less 
5 polymerase since a single enzyme molecule will extend 
many different primers during the course of a sequencing 
reaction. and any given primer end will have the 
opportunity to be extended by many different polymerase 
molecules, decreasing the chance of strong specific stops 
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from occurring. 

However, it is better that a polymerase that cycles 
rapidly also incorporates ddUMPs efficiently, in order to 
give bands of uniform intensity and allow one to use less 
5 ddNTPs. It is also preferred that such a polymerase have 
low or no exonuclease activity, and that one add 
pyrophosphatase to prevent degradation of bands by 
pyrophosphorolysis . 

It is also preferred that one be able to carry out 
10 DNA sequencing reactions with magnesium as the only 
divalent cation (i.e., the absence of manganese) . First, 
polymerases tend to be less active with manganese 
compared with magnesium (see for example Tabor and 
Richardson. Proc. Natl. Acad. Sci . USA 86, 4076-4080 
15 (1989)). Second, while polymerases tend to be active 
over wide ranges of magnesium concentrations, there is a 
very sharp, low optimum manganese concentration required 
in most cases for optimum activity (id.) . And at the 
optimum manganese concentrations there is much less 
20 effect on reduction of discrimination against ddNTPs as 
at much higher concentrations, where the polymerase is 
much less active. Third, manganese is not as convenient 
a metal ion to have in a kit; it readily forms 
precipitates, particularly at higher pH's. Fourth, it is 
25 not clear whether manganese will be effective as a metal 
ion for reducing discrimination against ddNTPs with any 
thermophilic polymerase (i.e., at higher temperatures) . 

Prior to the present invention, we stated (Tabor and 
Richardson Proc. Natl. Acad. Sci. USA 86, 4076-4080 
30 (1989)) that there was a correlation between 
discrimination and processivity : 

"DNA polymerase I has a low processivity, 
dissociating after the incoi^joration of less than 10 
nucleotides. There is a strong correlation between 
^5 the frequency at which the enzyme dissociates from 

a site during DNA synthesis in the absence of 
ddNTPs, and the extent of discrimination against the 
incorporation of a ddNMP at that site (unpublished 
results) . This suggests that DNA polymerase I 
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incorporates dNMPs and ddNMPs at similar rates 
during processive synthesis; however, when synthesis 
is nonprocessive, dNMPs are incorporated 
preferentially over ddNMPs. This model could 
5 account for the greater variabiliy . in ddNMP 

incoporation byDNA polymerase I complared to T7 DNA 
polymerase, since the latter has a processivity two 
orders of magnitude greater, than the former." 
[Citations omitted.] 
Thus the results of this invention with E. coll DNA 
polymerase I and Tag DNA polymerase are surprising, since 
we find no evidence that the mutants described herein do 
increase the processivity of the mutant enzymes. 

15 Thermonhil ir- p plvmf»-ra>:^f.ff 

Thermophilic polymerases that discriminate against 
a ddNTP by less than a factor of 100 are particularly 
useful in this invention. m, addition, those that 
discriminate against a ddNTP by less than a factor of lOO 
in the presence of magnesium as the only divalent cation 
and preferably cycle from one primer- template to another 
more than once per second are useful. Thermophilic 
polymerases are defined as polymerases that have optimum 
DNA polymerase activity in a 15 min. reaction at a 
25 temperature above 60*>C. 

Uniform banrl -i n^ena-i H 

Even though manganese reduces the discrimination of 
Klenow fragment against ddATP from 550 to 3.9 fold Tabor 
30 and Richardson {Proc. Natl. Acad. 5ci. USA 86, 4076-4080 
(1989)) show that there is still wide variability in the 
intensity of individual bands (see Figure 2 id.). Thus, 
apart from T7 DNA polymerase, this invention is the first 
to provide polymerases that cycle rapidly such as Klenow 
fragment and those that are derived from thermophilic 
organisms to produce bands that have uniform intensities, 
even in the presence of magnesium as the only divalent 
cation, .conditions that are preferable for the activity 
of most polymerases (see above). Enzymes which cycle 
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rapidly can be determined by methods known in the art as 
described below. 

Specific Polvnif*r-aof.,c, 

5 From the above information it is possible to readily 

make the following polymerases which will have the 
desired properties discussed above. Each of these 
polymerases can be used for sequencing procedures if the 
level of exonuclease is low enough and the activity of 
the polymerase is sufficient (both of which are well 
known in the art) . See Braithwaite and I to, supra for 
reference to each amino acid site. 

Pol T family 



^- ^^^"^ polymerase I with altered Phe762 

(altered means replaced with e.g., Tyr, or an equivalent 
amino acid to give the non -discriminatory property) . 

Streptococcus pneumoniae DNA polymerase I with 
altered Phe7ll. 

20 Thermus aquaticus DNA polymerase I with altered 

Phe667. 

Thermus flavus DNA polymerase I with altered 
Phe666. 

Bacteriophage T5 DNA polymerase with altered Phe570. 
25 Bacteriophage Spo 1 DNA polymerase with altered 

Leu526. 

Bacteriophage Spo 2 DNA polymerase with altered 
Phe690. 

Mitochondrial DNA polymerase with natural Tyr753 or 
30 altered at this site without reducing the non- 
discriminatory activity. Such a polymerase has not 
previously been used for DNA sequencing. Applicant 
believes that it will be useful in such a procedure 
because of its expected low level of ddNTP 
35 discrimination. If needed, it can be modified to reduce 
exonuclease activity associated with the polymerase 
activity. 
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Polymerase aloha family (also galled po l ymerase IT 

family) 

Delarue et al . , Protein Engineering 3, 461-467 
(1990) show that the two families of polymerases 
5 (polymerase I family and polymerase alpha family) share 
three common motifs. The region they call "Motif B" 
contains the residue we have identified as responsible 
for specificity for the dideoxyribose moiety. This 
region is characterized by the sequence K Ni Nj N3 N5 
10 N7 Y G in the polymerase I family, whejce N is the 
specificity residue: if N4 is a phenylalanine there is 
high discrimination, if N4 is tyrosine there is low 
discrimination- In the polymerase alpha family, the 
sequence is K ij y G (there is one less 

15 base between the conserved residues) . We predict 
therefore that just as with polymerase I type enzymes/ 
changes to the residue (s) in this motif (betwen the 
lysine (K) and the tyrosine (Y) ) will reduce the 
discrimination of these polymerases to ddNTPs . These 
20 residues are as follows: 

Escherichia coli DNA polymerase II 
PRDl DNA polymerase 
0 29 DNA polymerase 
25 M2 DNA polymerase 
T4 DNA polymerase 

Thermuococcus litaralis DNA polymerase 
(Vent) 

Pyrococcus furiosus DNA polymerase 
3 0 Sulfolobus solfataricus DNA polymerase 
Human DNA polymerase alpha 

cerevisiae DNA polymerase I (alpha) 
S. powbe DNA polymerase I (alpha) 
Drosophila melanogaster DNA polymerase 
3 5 alpha 

Trypanosoma . brueei DNA polymerase 
alpha 

Human DNA polymerase del t a 
Bovine DNA polymerase delta 
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Ile494-Phe499 
Iieu341-Ser346 
Leu384-Leu389 
Leu381-Leu386 
Ile558-Leu563 

Leu492-Tyr497 
Leu489-Phe494 
Val604-Thr609 
Leu951-His956 
Lieu945-His950 
Iieu931-His936 

Leu960-His365 

Leu845-His850 
Val695-Val700 
Val694-Val699 
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S. cerevisiae DNA polymerase III 
(delta) 

S. pombe DNA polymerase III 
(delta) 

5 Plasmodium falciparum DNA polymerase 
delta 

S. cerevisiae DNA polymerase II 

(epsilon) 
S. cerevisiae DNA polymerase Rev3 
10 Herpes Simplex virus type . 1 DNA 
polymerase 

Equine herpes virus type 1 DNA polymerase 
Varicella -Zoster virus DNA polymerase 
Epstein-Barr virus DNa polymerase 
15 Herpesvirus saimiri DNA polymerase 

Human cytomegalovirus DNA polymerase 
Murine cytomegalovirus DNa polymerase 
Human herpes virus type 6 DNA polymerase 
Channel Catfish virus DNA polymerase 

2 0 Chlorella virus DNA polymerase 

Fowlpox virus DNA polymerase 
Vaccinia virus DNA polymerase 
Choristoneura biennis DNA polymerase 
Autographa calif arnica nuclear 
25 polyhedrosis virus (AcMNPV) 

DNA polymerase 
Lymantria dispar nuclear polyhedrosis 

virus DNA polymerase 
Adenovirus -2 DNA polymerase 

3 0 Adenovirus -7 DNA polymerase 

Adenovirus -12 DNA polymerase 
S-1 maize DNA polymerase 
kalilo neurospora intermedia DNA 
polymerase 

3 5 pAI2 Ascobolus immersus DNA polymerase 
pCLKl Claviceps purpurea DNA polymerase 
Maranhar neurospora crassa DNA polymerase 
pEM Agaricus bitorquis DNA polymerase 



Ile702-Val707 

Val681-Val686 

Ile€92-Val697 

Val82-Phe830 
Leul087-Thrl0 92 



Val812-Val817 

Val813-Val818 

Val776-Val781 

Cys682-Val687 

Val671-Val676 

Val811-Phe816 

Val717-Phe722 

Ile667-Val672 

Ile750-His755 

Ile586-Val591 

Ile648-Val653 

Ile637-Val642 

Ile669-Leu674 



Arg606-Ile611 

Arg624-Ile629 
Leu696-Iieu701 
Leu762-ljeu767 
I>eu694-Leu699 
Ijeu618-Lieu623 

Lieu776 -Lieu777 
Leu951-Leu956 
Leu831-Leu836 
Ijeu752-Lieu757 
Leu573-Leu578 
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pGKLl Kluyveromyces lactis DNA polymerase Ile785-Leu790 
pGKL2 Kluyveromyces lactis DNA polymerase Ile770-Gly776 
pSKL Saccaromyces kluyveri DNA polymerase Ile775-Gly781 

5 Examples 

The following are examples of methods for determining 
the processivity and cycle times for various polymerases. 
Also provided are examples for determining the level of 
discrimination by a polymerase, and other methods useful 
10 in this invention. 

Examplft 1 , MUtacrenesis of DNA pnlym^-rase qen^^ and 

overproduction of mutant DNA pm vmeraAf^c, 

standard techniques are used for the cloning and 
15 expression of mutant DNA polymerase genes. The genes for 
the large fragment of E. coli DNA polymerase I (Klenow 
fragment) and the large fragment of Taq DNA polymerase 
(KlenTaq or _Taq DNA polymerase, see Barnes 112 Gene 29, 
1992 or Stoff el fragment, see Lawyer et aJ. 2 PCR Methods 
20 Appl 275, 1993), the starting materials for the 
generation of mutants in E. coli DNA polymerase I and Taq 
DNA polymerase, are expressed under the control of the T7 
RNA polymerase promoter. The gene for the J2B amino acid 
deletion of T7 DNA polymerase (see Tabor and Richardson 
25 264, J. Biol. Chem. 6447, 1989), the starting material 
for the generation of mutants in T7 DNA polymerase, is 
expressed under the control of the lac promoter in a 
strain that produces E. coli thioredoxin, a necessary 
factor for processive DNA synthesis by T7 DNA polymerase 
3 0 (Tabor and Richardson, supra) . The gene for Taq DNA 
polymerase mutant F667y is transferred from the gene that 
produces _Taq DNA polymerase to the gene that produces 
the full length Taq DNA polymerase by standard techniques 
using PCR and restriction digestion followed by ligation. 
3 5 Mutagenesis to construct the mutant DNA polymerases 

is carried out using standard mutagenesis techniques by 
PCR similar to the method described by Sarkar and Sommer 
8 BioTechniques 404, 1990. To construct hybrids in which 
more than four amino acid residues are being exchanged. 
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two hybrid primers are construcced whereby PGR is first 
carried out on the donor DNA, and then that product is 
used for PGR on the recipient DNA, generating the hybrid 
molecule. For the construction of hybrids in which the 
5 exchange of domains is four amino acid residues or less, 
^ single PGR primers are synthesized that contain the 
entire region to be transferred as well as the proper 
flanking sequences of the recipient, and that primer is 
used to construct the hybrid molecule directly. 
10 Overproduction of the mutant DNA polymerases is 

carried out using standard techniques (see for example 
Gurrent Protocols in Molecular Biology, Ausubel et al , 
eds.. Chapter 16, 1994) . Mutant proteins are purified by 
standard procedures including ion exchange 
15 chromatography. For the purification of E, coli DNA 
polymerase I mutants, see for example Joyce and Grindley 
80 Proc. Natl. Acad. Sci . 1830, 1983. For the 
purification of Taq DNA polymerase mutants, see for 
example Engelke et al . 191 Analytical Biochemistry 396, 
20 1990. For the purification of T7 DNA polymerase 
mutants, see for example Tabor and Richardson 264, J. 
Biol. Chem. 6447, 1989. Polymerase specific activities 
of each of the purified mutant proteins are determined 
by standard procedures described in these references. 

25 Examp l e 2. Rapid screen of DNA polymerages for mutants 

that are i mproved in their efficiency of i n corporating a 
d i deQXVnuclePtide relative to a deoxynuclf^otidf* 

Mutant DNA polymerases are screened for their ability 
to incorporate dideoxynucleotides by SDS activity gel 
30 analysis. The procedure is a modification of that 
described by Spanos and Hubscher 91 Methods in Enzymology 
263, 1983 and Karawya et al . 135 Analytical Biochemistry 
318, 1983. Briefly, 10 ml of cells are induced for 4 to 
6 hours and then pelleted. The cell" pellet is 
35 resuspended in 0.3 ml 25 mM TrisHGl, pH 7 . 0 , 5 mM EDTA. 
20 1 of the resuspended cells are mixed with 40 1 of a 
solution of 1% SDS {sodium dodecyl sulfate) , 2% 
mercaptoethanol, 3 0 % glycerol, 0.04% bromphenol blue, 
and 100 mM TrisHCl, pH 6.8. The mixtures are incubated 
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at 37 °C for 5 min, and then 20 1 aliquots are loaded in 
duplicate onto two SDS polyacrylamide gels. The SDS 
polyacrylamide gels consist of a resolving gel that 
contains 8% polyacrylamide/ 0.27% bisacirylamide, 190 mM 
5 TrisHCl, pH 8.8, 0.05% SDS, and 25 g/ml denatured 
salmon sperm DNA, and a stacking gel that consists of 5% 
polyacrylamide, 0.17% bisacirylamide, 150 mM Tris-HCl, pH 
€.8, and 0.1% SDS. The two gels are electrophoresed at 
100 V for 13 hr at a constant temperature of 13 in an 
10 electrophoresis buffer consisting of 190 mM Tris HCl and 
pH 8.8, 0.05% SDS. 

After electrophoresis, the gels are washed over 8 hr 
in 4 changes of 500 ml each of Renaturation Buffer (50 mM 
Tris.HCl, pH 7.5, 5 mM magnesium acetate, 1 mM 
15 dithiothreitol, 40 mM KCl, 400 g/ml bovine serum 
albumin, 16% glycerol and 0.95 mM EDTA) at 4 <*C. 

The renatured proteins are assayed for DNA polymerase 
activity by incubating each of the two gels in 6 ml of 
Renaturation Buffer, 1.5 M 4dNTPs, 4 1 of [a-^^pj^ATP 
20 (800 Ci/mmol, 10 mCi/ml) , and 80 g of purified 
thioredoxin. One of the mixtures also contains 30 M 
ddTTP (a 20 -fold molar excess over dTTP) . The mixtures 
are incubated for 4 hr at 37 °C (70 ^'C for 2 hr for 
thermophilic DNA polymerases) . 
25 After incubation the gels are washed for 8 hr against 

four changes of 5% trichloracetic acid and 1% sodium 
pyrophosphate. The gels are then dried and 

autoradiographed . 

To determine whether a mutant DNA polymerase is 
30 discriminating more or less against ddTTP, the 
intensities of the radioactive bands are compared on the 
two gels that were incubated in the presence and absence 
of ddTTP, and the ratio of the signal in the two bands 
for the unmodified DNA polymerase is compared with the 
3 5 ratio of the signal in the two bands for each of the 
mutants. If a mutation results in a DNA polymerase being 
less discriminatory towards ddTTP, then there will be a 
greater percentage decrease in radioactivity in the band 
in which ddTTP was present for the mutant DNA polymerase 
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compared with the unmodified DNA polymerase. For 
example, under these conditions the radioactive bands 
observed for cells containing induced E. coli DNA 
polymerase I or T7 DNA polymerase mutant Y52 6F are 
5 approximately the same intensity (within a factor of two) 
in reactions carried out in the presence versus the 
absence of ddTTP. In contrast, for cells containing 
induced E. coli DNA polymerase I mutant F762Y or T7 DNA 
polymerase, the bands on the gel in which the reactions 
10 are carried out in the presence of ddTTP are less that 5% 
the intensity of the bands corresponding to the reactions, 
carried out in the absence of ddTTP. 

This assay represents a rapid method of screening a 
large number of DNA polymerase mutcuits for their ability 
15 to discriminate against dideoxynucleotides . It can 
detect changes of at least 5-fold in the relative rate of 
discrimination. However, this assay should be followed 
up with purification of potentially interesting mutant 
DNA polymerases and more rigorous assays of the purified 
2 0 proteins similar to those described below in order to 
determine precisely the effect of each mutation on 
discrimination against dideoxynucleotides. 

The following examples are methods for determining 
the processivity and cycle times for various polymerases, 
25 determining the level of discrimination against ddNTPs by 
a polymerase, determining the uniformity of bands 
generated by dideoxy-terminated fragments on a DNA 
secjuencing gel, and using the DNA polymerases of this 
invention for DNA sequence analysis. 
30 ' 

Example 3, Preparation and purification of a single- 
stranded M13 DNA - 5' "P-labeled 40-mer primer complex 

The template is M13 mGPl-2 single -stranded DNA, 9950 
nucleotides in length. as described in U.S. Patent 
35 4,795,699 (Figure 9). The phage M13 mGPl-2 is deposited 
in ATCC as number 40303 . M13 mGPl-2 single -stranded DNA 
is purified as described in Tabor et al . , 262 J. Biol. 
Chew. 16212, 1987. Briefly, the phage is purified 
through two CsCl gradient centrif ugations , the CsCl is 
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removed by dialysis, the DNA is removed from the phage by 
extraction with phenol and chloroform in the presence of 
0.1% sodium dodecyl sulfate, and the extracted DNA is 
dialyzed extensively against 20 mM Tris HCl, pH 7.5, 2 mM 
5 EDTA and stored at 4 ^C. The concentration of M13 mGPl-2 
single -stranded DNA is determined spectrophotometrically 
using an extinction coefficient of 8.1 Aaso units = 1 
mg/ml, or 0.3 pmoles of M13 mGPl-2 template molecules per 
microliter. 

10 The primer is a synthetic 4 0-mer having the sequence 

5 • d (TTTTCCCAGTGACGACGTTGTAAAACGACGGCCAGTGCCA) 3 • 

synthesized by standard procedures. It is complementary 
to M13 mGPl-2 DNA at nucleotides 9031 to 8992 (see '699 
patent supra for sequence) . The primer is purified by 

15 ion exchange chromatography or denaturing polyacryl amide 
gel electrophoresis prior to end labeling. 

The primer is labeled and annealed to the template 
essentially as described in Tabor et al (supra) , The 
primer is 5* end labeled in a reaction mixture (15 1) 

20 containing 40 mM TrisHCl, pH 7.5, 10 mM MgClz, 5 mM 
dithiothreitol, 100 mM NaCl, 50 g/ml BSA, 50 Ci 
[g^^PlATP, 6000 Ci/mmol, 5 pmoles of primer, and 10 units 
of T4 polynucleotide kinase prepared from the Es^Tl 
mutant, which is deficient in phosphatase activity. The 

25 mixture is incubated, at 37^C for 15, min, followed by 
70**C for 15 min to inactivate the kinase. 60 1 of 
single-stranded M13 mGPl-2 DNA {0.25 mg/ml), 6 1 of IM 
NaCl and 3 1 of 0.2 M MgClj are added, and the mixture 
is slowly cooled from 70 to room temperature (about 2 0 

30 - 25**C) over a period of 30 min. The mixture is then 
extracted once with a 1:1 mixture of phenol and 
chloroform, and after centrif ugation for 3 0 sec in a 
microfuge, the aqueous phase (70 1) is placed on a 1 ml 
column of Sepharose CL-6B equilibrated in 20 mM 

35 Tris HCl, pH 7 . 5 , 2 mM EDTA, and 100 mM NaCl . The 
labeled primer- template complex is eluted from the column 
with the same buffer used for equilibration; the labeled 
complex elutes in the void volume. After elution, the 
complex is at a concentration of approximately 50 g/ml 
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{0.015 pmoles of molecules per 1) with a specific 

activity of approximately 200,000 cpm/ 1. 

Examp l e 4, Determination of the pmr^ ssivity of a dna 

5 polymerase bv dilution rf^^i- 

Processivity is determined by enzyme dilution 
essentially as described in TaJbor et al (supra) and Tabor 
and Richardson, 84 Proc. Natl. Acad. Sex USA 4767, 1987. 
The reactions are carried out under the same conditions 
10 as in the extension/termination reactions used in DNA 
sequencing (Tabor and Richardson, supra), except that 
ddNTPs are omitted and the polymerase concentration is 
reduced in order to have an excess of primer- template 
molecules over polymerase molecules in some of the 
15 reactions. The primer-template consists of the single 5* 
end- labeled primer annealed to a single -stranded M13 DNA 
molecule as described in Example 3. 

The extension reaction mixtures are prepared 
substantially as described in Tabor et al (supra) . Each 
20 reaction mixture (18 1) contains 1.0 1 annealed "P- 
labeled primer-M13 DNA as described in Example 3 (-0.015 
pmoles, -200,000 cpm), 40 mM Tris*HCl, pH 8.0, 5 mM MgClj, 
5 mM dithiothreitol, and 300 M 4dNTPs. The mixtures are 
incubated at 37«C for 1 min (70«C for thermophilic DNA 
25 polymerases) . The reactions are initiated by the 
addition of 2 1 aliquots of dilutions of the DNA 
polymerase being analyzed, diluted in 20 mM Tris HCl, pH 
7.5, 10 mM 2-mercaptoethanol and 0.05% bovine serum 
. albumin. The reaction mixtures are further incubated at 
30 37°C (70^C for thermophilic DNA polymerases) for either 
3 0 sec or 3 min. At the indicated times, 8 1 aliquots 
are removed and added to either 8 1 of 90% formamide, 2 0 
mM EDTA, 0.05% bromphenol blue for. denaturing 
polyacrylamide gel electrophoresis, or 2 1 of 100 mM 
35 EDTA, 2 % sodium dodecyl sulfate for alkaline agarose gel 
electrophoresis . 

The samples are analyzed by either denaturing 
polyacrylamide gel electrophoresis or alkaline agarose 
gel electrophoresis.. Denaturing polyacrylamide gel 
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electrophoresis is most suitable for analyzing 
polymerases with an average processivity of less than 500 
nucleotides, while alkaline agarose gel electrophoresis 
provides a more sensitive estimate of the processivity of 
5 DNA polymerases with an average processivity greater than 
500 nucleotides; however, either method can be used 
successfully to determine the average processivity of any 
DNA polymerase . 

To determine the processivity by denaturing 

10 polyacrylamide gel electrophoresis, aliquots in formamide 
are heated at 90°C for 2 min immediately prior to 
loading 6 1 of each sample onto a gel consisting of 8% 
polyacrylamide, 0.4% N, -methylenebisacrylamide, 7 M 
urea in 100 mM Tr is borate, pH 8 , 3 , 1 mM EDTA. 

15 Electrophoresis is at 2000 Volts for 90 min (until the 
bromphenol blue has just run off the bottom of the gel) . 
Suitable 5* ^^P-end labeled molecular weight markers are 
also loaded onto the gel that allow the determination of 
fragment sizes 100 to 500 nucleotides in length. An 

20 example of such suitable markers are T7 Hpal fragments 
that have been dephosphorylated with alkaline phosphatase 
and then 5' ^^p.^^d labeled using PjATP and T4 

polynucleotide kinase using standard procedures (Sambrook 
et al., 1989, Molecular Cloning A Laboratory Manual, Cold 

25 Spring Harbor Laboratory, N.Y. pp. 6.20-6.21 and Ausubel 
et a J . , 1989, Current Protocols in Molecular Biology, 
Greene Publishing Associates and Wiley Interscience, 
N.YJ. After electrophoresis, the gel is dried under 
vacuum, and autoradiographed . After autoradiography, the 

3 0 distribution of radioactively labeled fragments is 
determined by Phosphorimager analysis (Molecular 
Dynamics) . 

Products are analyzed by alkaline agarose gel 
electrophoresis as described by (1) Villani et al . , 256 
35 J. Biol. Chem. 8202, 1981, by (2J Sabatino et al . , 27 
BiochejTiistry 2998, 1988, and by (3) Sambrook, ec al . , 
supra . To prepare the agarose gel, 250 ml of a 1.5% 
agarose solution in 2 mM EDTA, pH 8.0 is heaced in a 
microwave oven co dissolve the agarose, and then allowed 
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to cool to 60<'C. 8.75 ml of 1 N NaOH (final 

concentration 35 mM NaOH) is added, the gel is poured 
into an agarose gel electrophoresis mold. The gel is 
allowed to set for 2. hours before use. The 
5 electrophoresis buffer is 35 mM NaOH and 2 mM EDTA. The 
samples are prepared by taking the 10 1 aliquots 
described above (in 20 mM EDTA, 0.4% sodium dodecyl 
sulfate) and denatured by the addition of 1 1 of 1 N 
NaOH and heating at 60°C for 10 min. 7 1 of 75 % 

10 glycerol, 0.2 % bromcresol green are added to each 
sample, and then the samples are loaded onto the alkaline 
agarose gel. Electrophoresis is carried out at 4°C at a 
constant current of 150 mA for 15 hours (until the 
bromcresol green has migrated about 14 cm) . The 

15 electrophoresis chamber has dimensions of 26 cm (length) 
X 20 cm (width) by 2 cm (height) - After electrophoresis, 
the gel is soaked for 2 hours in 10% trichloracetic acid, 
dried under vacuum, and autoradiographed. After 
autoradiography, the distribution of radioactively 

20 labeled fragments is determined by Phosphorimager 
analysis (Molecular Dynamics) . 

To test the processivity of a given DNA polymerase, 
the concentration of that polymerase in the reaction 
mixture is diluted by 2- fold increments until only a 

25 fraction (e.g. 25%) of the primers are extended, while 
the majority (e.g. 75%) remained unchanged, 40 
nucleotides in length. Under these conditions, a two- 
fold increase or decrease in DNA polymerase concentration 
should result in about a two fold increase or decrease, 

3 0 respectively, in the fraction of primers extended. The 
average length of the labeled fragments that are extended 
is derermined either by visual inspection of the 
autoradiograph or by quantitation using a phosphoimager . 
For example, using this test, the exonuclease-def icient 

3 5 T7 DNA polymerase complexed to its processivity factor 
thioredoxin (e.g. SEQUENASE® Version 2.0, United States 
Biochemical Corporation) has an average processivity of 
more than 500 nucleotides, while Klenow fragment of E, 
coli DNA polymerase I has a processivity of less than 5 0 
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nucleotides. 

Example 5, — Determination of the rate o f cyclina of a DNA 

PQlvmerflse 

5 The rate of cycling is determined essentially as 

described TaJbor et al (supra) . The reactions are carried 
, out under the same conditions as in the 
extension/termination reactions used in DNA sequencing 
{Tabor and Richardson, supra), except that ddNTPs are 

10 omitted and the polymerase concentration is reduced in 
order to have an excess of primer- template molecules over 
polymerase molecules. The primer-template consists of 
the 5' [32p) end-labeled primer annealed to single-stranded 
M13 DNA molecule as described in Example 3. 

15 First a test is carried put to determine the 

functional ratio of primer- template molecules to 
polymerase molecules, for example, using the large 
fragment of E. coli DNA polymerase I (Klenow fragment). 
A dilution experiment is carried out as described in 

20 Example 4 , to determine the concentration of polymerase 
molecules necessary to extend 20% of the labeled primer- 
template molecules in 10 sec. This ratio of polymerase 
to primer- template is defined as being less than or equal 
to 1:5, and is used to determine the maximum rate of 

25 cycling of the polymerase as described below,, 

Extension reactions are carried out as described in 
Example 4 using the ratio of DNA polymerase to primer- 
template less than or equal to 1:5, as defined in the 
previous paragraph. Reactions are carried out under 

30 conditions (i.e. buffer, pH, salt, temperature) optimal 
for the polymerase being tested. Aliquot s are removed at 
10 sec, 20 sec, 40 sec and 80 sec, and the reactions are 
terminated and the products analyzed as described in 
Example 4. For DNA polymerases with low processivity 

35 (less than 100 nucleotides) , such as the large fragment 
of E. coli DNA polymerase I, samples are analyzed 
preferably by denaturing polyacrylamide gel 
electrophoresis. For DNA polymerases with high 

processivity (greater than 100 nucleotides) , such as T7 
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DNA polymerase, samples are analyzed preferably by 
alkaline agarose gel electrophoresis. After 
electrophoresis, the gels are dried under vacuum and 
analyzed either by autoradiography or with a 
5 Phosphoimager . 

When reactions are carried out with a polymerase that 
cycles very slowly, such as T7 DNA polymerase, there is 
not a significant decrease (i.e. less than a factor of 
two) in the number of unext ended primers between 10 sec 
10 and 80 sec. Thus, if the number of unextended labeled 
primers does not decrease by more than two fold between 
the 10 sec and 80 sec time points, then the DNA 
polymerase cycles slower than once per 70 sec. For 
polymerases that cycle rapidly, the number of unextended 
15 primers .will decrease by a significant fraction (i.e. by 
more than 2 fold) between the 10 and 80 sec time points. 
To determine the rate of cycling for these polymerases, 
the following equation is used: 

20 R = Ni X L(t2) /{L(ti) X (t2-ti)} 

where : 

R = minimum rate of cycling in cycles per sec. 

25 

Ni = ratio of primer- template molecules over 
functional DNA polymerase molecules, (N^ = 5 in the 
example above.) 

LCtj) = the maximum processivity of the DNA. polymerase 
30 being investigated, in nucleotides. This is defined as 
the maximum number of nucleotides extended from the 
labeled primers under conditions of limiting DNA 
polymerase (where only 20% of the primers are extended in 
the 10 sec time point), as described in Example 3. 
35 Lttj) = the maximum length of extension (in 

nucleotides) of the labeled primers at time ta, which is 
80 sec in this example. 

ti: the shortest time at which an aliquot is taken, 
or 10 sec in this example. 
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tj: the longest time the reaction is allowed to 
precede before removing an aliquot, or 80 sec in this 
example. 

When this test is carried out for the large fragment 
5 of E. coli DNA polymerase I, a value of more than 0.2 
cycles per sec is obtained. 

Examples 6 and 7 provide tests to determine the 
efficiency of incorporation of dideoxynucleotides for an 
unknown DNA polymerase using a 5' ^^P-labeled 40-mer 
10 primer annealed to a single -stranded M13 DNA template and 
a gel eilectrophoresis -based analysis. Example 6 is best 
for DNA polymerases that efficiently incorporates 
dideoxynucleotides (e.g. wild-type T7 DNA polymerase and 
Taq DNA polymerase. F667Y> while Example 7 is best for DNA 
15 polymerases that discriminate strongly against the 
incorporation of dideoxynucleotides (e.g. T7 DNA 
polymerase Y526F and wild-type Taq DNA polymerase) . 

Example 6. Gel electrophoresis -based determination of 
20 the rate of incorporation of dideoxynucleotides relative 

to deoxynucleotideg using a 1:1 ratio ot dNTPs to ddNTPs. 

The primary application of this test is to determine 
the absolute ratio of incorporation of a ddNMP to a dNMP 
for a DNA polymerase that efficiently incorporates 

25 dideoxynucleotides, such as T7 DNA polymerase, E. coli 
DNA polymerase I mutant F762Y or Taq DNA polymerase 
mutant F667Y. It can also indicate the level of 
discrimination against ddNTPs of any DNA polymerase; 
however, for DNA polymerases that discriminate strongly 

3 0 against ddNTPs, such as T7 DNA polymerase mutant Y52 6F, 
E. coli DNA polymerase I or Taq DNA polymerase, higher 
ratios of ddNTP to dNTP are necessary to determine 
precisely their level of discrimination, which is 
described in detail in Example 7. 

3 5 DNA synthesis reactions are carried out on the ^^P-end 

labeled 40 mer-M13 mGPl-2 DNA template complex prepared 
as described in Example 3 . Reaction conditions are used 
that are optimum for the DNA polymerase being tested with 
regard to buffer, pH, salts, and temperature of the 
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reaction. A concentration of DNA polymerase is chosen 
whereby most of the primers are extended in a 10 min 
reaction and are terminated by the incorporation of a 
dideoxynucleotide . The reaction mixture contains 100 M 
5 4dNTPs and 100 M of one of the four ddNTPs. 

We used this test to compare the ability of six DNA 
polymerases to incorporate each of the four ddNMPs . The 
DNA polymerases tested were (1) T7 DNA polymerase with a 
28 amino acid deletion in the exonuclease domain and 
10 complexed in a one-to-one ratio with thioredoxin (Tabor 
and Richardson 264 J. Biol. Chem. 6447, 1989) (referred 
to here as ^*T7 DNA polymerase"), (2) The large fragment 
of E. coli DNA polymerase I, commonly called the Klenow 
fragment (referred to here as "E, coli DNA polymerase 
15 I") , (3) Unmodified DNA polymerase from Thermus aguaticus 
(referred to here as "Taq DNA polymerase"), (4) T7 DNA 
polymerase as described above in which the tyrosine at 
residue 526 has been changed to a phenylalanine (referred 
to here as **T7 DNA polymerase y526F"), (5) E. coli DNA 
20 polymerase I as described above in which the 
phenylalanine at residue 762 has been changed to a 
tyrosine (referred to here as "E. coli DNA polymerase I 
F762Y"), and (6) Taq DNA polymerase as described above in 
which the phenylalanine at residue 667 has been changed 
2 5 to a tyrosine (referred to here as **Taq DNA polymerase 
F667Y") . 

To test the relative rate of use of each of the four 
ddNTPs compared to the comparable dNTPs for each of the 
DNA polymerases indicated above, the reaction mixtures (8 

30 1) contained 1.0 1 annealed ^^P-labeled primer-M13 DNA 
as described in Example 3 (-0.015 pmoles, -200,000 cpra) , 
40. mM Tris HCl, pH 8 . 0 , 5 mM MgClj, 5 mM dithiothreitol ; 
50 mM NaCl, 100 M 4dNTPs, and 100 M ddCTP . The 
reaction mixtures also contained 10 ng of yeast inorganic 

35 pyrophosphatase to inhibit pyrophosphorolysis that could 
otherwise increase the apparent discrimination by the DNA 
polymerase (Tabor and Richardson 265 j. Biol. Chem. 8322, 
1990) . The reactions were initiated by the addition of 
2 1 of each DNA polymerase, diluted in 20 mM Tris HCl, 
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pH 7.5, 10 mM 2 -mercaptoethanol and 0.05% bovine serum 
albumin to a concencration of approximately 0.025 units/ 
1 . The concentration of each DNA polymerase was 

sufficient to extend most of the labeled primers by more 
5 than 500 nucleotides in the absence of ddNTPs in a 15 min 
reaction. The reaction mixtures were incubated for 15 
min at either 37 «C (T7 DNA polymerase, T7 DNA polymerase 
Y526F, E. coli DNA polymerase I, and E. coli DNA 
polymerase I F762Y) or 70 (Taq DNA polymerase and Taq 
10 DNA polymerase F667Y) . The reactions were terminated by 
the addition of 10 1 90% formamide, 20 mM EDTA, 0.05% 
bromphenol blue. Each sample was heated at 90°C for 2 
min immediately prior to loading 6 1 of each sample onto 
a gel consisting of 8% polyacrylamide, 0.4% N,N*- 
15 methylenebisacrylamide, 7 M urea in 100 mM Tris borate, 
pH 8.3, 1 mM EDTA. Electrophoresis was at 2000 Volts for 
90 min (until the bromphenol blue had just run off the 
bottom of the gel). After electrophoresis, the gel was 
dried under vacuum, and autoradiographed . After 
20 autoradiography, the distribution of radioactively 
labeled fragments was determined by Phosphor imager 
analysis (Molecular Dynamics) . Alternatively, the 
relative intensities of dideoxy- terminated bands can be 
determined by scanning the autoradiograph using an 
25 instrument such as the SciScan 5000 imaging densitometer 
(United States Biochemical Corp) . 

When the set of four reactions (each containing a 
single ddNTP at an equimolar concentration as the dNTP) 
was carried out with each of the six DNA polymerases 
3 0 described above, the reactions with three of the DNA 
polymerases (T7 DNA polymerase Y52 6F, E. coli DNA 
polymerase I and Taq DNA polymerase) resulted in most 
(>50%) of the radioactivity in the primers that had been 
extended migrating at the top of the gel, corresponding 
35 to fragments greater than 300 bases in length. Based on 
the predicted exponential decay of signal with increasing 
fragment size, this corresponds co discrimination by 
these three DNA polymerases of more than 100 fold against 
all four ddNTPs. A more precise measurement of the 
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discrimination against ddNTPs by these three DNA 
polymerases is obtained using the test in Example 7 
below. 

For the other three DNA polymerases {T7 DNA 
5 polymerase, E. coli DNA polymerase I F762Y and Taq DNA 
polymerase F667Y) the autoradiograph showed a series of 
dideoxy- terminated fragments with all of the reactions. 
In general, the average lengths of the labele'd 
synthesized fragments were lowest for Taq DNA polymerase 
10 F667Y, with only about six radioactively labeled dideoxy- 
terminated fragments visible with even a several day 
exposure of the film. The average lengths of labeled 
fragments with E. coli DNA polymerase I F762Y are 
slightly longer than with Taq DNA polymerase F667Y, while 
15 the average lengths are significantly longer with T7 DNA 
polymerase. The fragments are more uniform in intensity 
when synthesized by E. coli DNA polymerase I F762Y and 
Taq DNA polymerase F667Y that by T7 DNA polymerase. 

The distribution of radioactivity in the fragments 
20 was quantitated by Phosphoimager analysis (Molecular 
Dynamics) . The total amount of labeled primers in each 
lane was detennined by ninning three control reactions in 
which no DNA polymerase was present, and the 
radioactivity in each of the corresponding radioactive 
25 bands on the gel at the position of the unextended primer 
was determined. With some preparations of radioactively 
labeled primers, a certain percentage {<10%) is not 
extended by any of the DNA polymerases, regardless of the 
concentration of DNA polymerase used; this background 
3 0 level is determined by measuring the percentage of 
radioactivity remaining at the position of unextended 
primer for a series of four reactions containing ddNTPs, 
and subtracting the average of these four values from the 
cotal number of counts determined previously. This value 
3 5 is defined as the total number of counts in primers that 
are capable of being extended by a DNA polymerase. 

The total number of counts (i.e. radioactivity) in 
the first three dideoxy-terminated fragments were 
determined for T7 DNA polymerase, E. coli DNA polymerase 
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I F762Y and Taq DNA polymerase F6 67Y for each of the four 
ddNTP reactions. The values are presented in the Table 
below as the percentage of counts in the first three 
dideoxy- terminated fragments to the total number of 
5 counts in the primers capable of being extended by a DNA 
polymerase. 

Polymerase Reaction 

ddGTP ddATP ddTTP . ddCTP 

T7 DNA 

10 polymerase 67% 66% 76% 61% 

E. coli DNA 
polymerase 

I F762Y 95% 92% 96% 92% 

15 

Taq DNA 

polymerase F667Y 97% 95% 95% 99% 

As a further test of the efficiency of each DNA 
20 polymerase to incorporate dideoxynucleotides, the number 
of counts in each fragment with a significant signal was 
determined for each reaction, and the data were plotted 
as a function of the fragment number using the Macintosh 
program Kaleidograph Version 3.0 (Synergy Software) . The 
25 resulting plots were fit to an exponential decay curve 
using the Kaleidograph library routine for this function. 
The decay curve is given by the equation: 



30 



Y = e 



where : 



M*X 



Y = 1 - (the fraction of labeled primers in fragments 
1 to X compared to the total number of primers that can 
35 be extended) 

X = the . fragment number (the first dideoxy- 
terminated fragment is 1) 

M = the exponential decay function calculated for the 
data by the Kaleidograph library routine. 
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In the Table below, the following data are provided 
for each of the four ddNTP reactions using T7 DNA 
polymerase, E. coli DNA polymerase I F762Y and Taq DNA 
5 polymerase F667Y: 

N, the number of fragments used to fit each 
exponential curve. 

M, the calculated exponential decay function as 
described above. 
10 D, the discrimination factor given as the ratio of 

the use of a specific dNTP to the use of the comparable 
ddNTP when both nucleotides are present at equal 
concentrations. D is calculated from the equation above 
using the calculated value of M to determine Y when X = 
15 1, and defining D, the ratio of preference of the dNTP to 
the ddNTP, asY/{l-Y). 

R2, the correlation index for the data, was calculated 
by the Kale idograph library routine. This is a measure 
of the variability in band intensities, or the sequence - 
20 specific variability in the ability of a DNA polymerase 
to incorporate the specific dideoxynucleotide . 
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PolymeraseddNTP N M 



T7 DNA ddGTP 8 -0.375 2.2 0.813 

poly- 
merase 

ddATP 6 -0.3 56 2.3 0.996 

ddTTP 5 -0,4 50 1.8 0.997 

ddCTP 8 -0.317 2.7 0.889 

E. coli ddGTP 5 -1.03 0.56 0.999 

DNA poly- 
merase I 
F762Y 

ddATP 5 -0.860 0.72 0.998 

ddTTP 5 -1.06 0.54 1.000 

ddCTP 6 -0.842 0.75 1.000 

Taq DNA ddGTP 5 -1.18 0.45 0.995 

poly- 
merase 
F667Y 

ddATP 6 -0.997 0.59 0.997 

. ddTTP 6 -1.01 0.56 0.996 

ddCTP 4 -1.44 0.32 0.9 96 

Averages : 

T7 DNA 4 ddNTP 2.3 .92 4 

poly^ 

merase 

E. coli 4 ddNTP 0.64 .999 

DNA poly- 
merase I 
F762Y 

Taq DNA 4 ddNTP 0.4 8 .996 

poly- 
merase 
F667Y 



In summary, T7 DNA polymerase discriminaces an 
average of 2.3 fold againsc ddNTPs, while E. coli DNA 
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polymerase I F762Y and Taq DNA polymerase F667Y actually 
prefer ddNTPs over dNTPs an average of 1.6 fold (1/0.64) 
and 2.1 fold (1/0.48), respectively. A comparison of 
indicates that the intensity of neighboring fragments are 
5 more uniform with E. coli DNA polymerase I F762Y and Taq 
DNA polymerase F667Y than with T7 DNA polymerase . For a 
more accurate measure of uniformity, a greater number of 
fragments could be included in the analysis by , reducing 
the level of ddNTPs (for example by 5 fold) in each 

10 reaction, reducing the decay in intensity at each 
position (see Example 13) . 

To deteimine the amount of discrimination against 
ddNTPs by a new DNA polymerase, reactions analogous to 
those described above would be carried but, and identical 

15 reactions would be carried out in parallel using T7 DNA 
polymerase (SEQUENASE Version 2.0, United States 
Biochemical Corporation) , with all reactions analyzed on 
the same gel. An initial comparison of the distribution 
of dideoxy- terminated bands obtained with the new DNA 

20 polymerase compared with those obtained with T7 DNA 
polymerase would indicate whether the new DNA polymerase 
discriminated more or less against ddNTPs than T7 DNA 
polymerase. For example, such a visual inspection using 
E. coli DNA polymerase I F762Y clearly shows that for 

25 reactions with each of the 4 ddNTPs, the number of 
fragments visible on the gel in reactions using E. coli 
DNA polymerase I F762Y are less (and smaller in average 
size; than those using T7 DNA polymerase. A more 
quantitative analysis could then be carried out analogous 

30 to that described above in order to calculate the 
exponential decay factor (M) , average relative rate of 
utilization of dNTPs relative to dNTPs (D) and 
variability in intensity (R^) for the new DNA polymerase 
as described above. 

35 One complication that can occur in this test is when 

the DNA polymerase has an associated exonuclease 
activity, such as the 5' to 3 ' exonuclease activity 
associated with Taq DNA polymerase and the 3' to 5' 
exonuclease activity associated with E. coli DNA 
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polymerase I and native T7 DNA polymerase (not the _2 8 T7 
DNA polymerase deletion mutant used in the experiments 
above), A 5' to 3 ' exonuclease activity is detrimental 
since it can remove the label on the 5' end of the 
5 primer, reducing the radioactivity signal being detected. 
This problem can be partially avoided by reducing the 
amount of DNA polymerase in the reaction mixture. In the 
example above, 0.025 units of Tag DNA polymerase resulted 
in virtually all of the primers being extended until 
10 terminated by incorporation of a dideoxynucleotide , 
without appreciable loss in radioactivity due to 5' to 3' 
exonuclease activity, whereas a 40 fold increase in Taq 
DNA polymerase activity, or 1 unit per reaction, resulted 
in the loss of virtually all "P from the 5' ends of the 

15 primer. An alternative approach to measuring the extent 
of discrimination for a DNA polymerase with a 5' to 3 ' 
exonuclease activity is to use a different assay such as 
those described in Examples 8-10. 

A3' to 5' exonuclease activity can complicate the 

20 assay described above by making the DNA polymerase appear 
to discriminate more against a ddNTP than it actually 
does (for example, see Tabor and Richardson, 86 Proc. 
Nsitl. Acad. Sci. 4076, 1987). This is because once a 
dideoxynucleotide has been incorporated, the exonuclease 

25 activity can preferentially remove the dideoxynucleotide 
so that DNA synthesis can continue, resulting in an 
increase in the length of the fragment. Preferably, the 
enzymes assayed in the test described above are devoid of 
such 3' to 5' exonuclease activity; examples are modified 

3 0 T7 DNA polymerase (Sequenase®, United States Biochemical 
Corporation) , Taq DNA polymerase, exonuclease-def icient 
Vent ( rhennococcus litoralis) DNA polymerase (New England 
Biolabs catalog number 257) , exonuclease -deficient Deep 
Vent (Pyrococcus GB-1) DNA polymerase (New England 

35 Biolabs catalog number 259 ) , exonuclease-def icient Pfu 
(Pyrococcus furiosus) DNA polymerase (Stratagene catalog 
number 600163) , and exonuclease-def icient Klenow fragment 
(E. coli DNA polymerase I, United States Biochemical 
Corporation, catalog number 70057) . In some instances, 
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such as with E. coli DNA polymerase I (Klenow fragment) 
the 3' to 5' exonuclease activity is weak and does not 
interfere significantly with this assay (see for example 
Tabor and Richardson 264 J. Biol* Chem. 6447, 1989). One 
5 method to determine whether a new DNA polymerase being 
tested has a 3' to 5' exonuclease activity that is 
interfering with the ability to accurately measure the 
discrimination against ddNTPs is to carry out the 
experiment described above, removing aliquots at 

10 different time points up to 6 0 min. If the size 
distribution of the dideoxy- terminated fragments 
increases with time, then it is likely that such a 3' to 
5' exonuclease activity is interfering with the assay, 
while if the distribution of fragments is constant over 

15 time then such an activity is not having a significant 
effect. If the average fragment length is increasing 
with time, then one should use a shorter incubation time 
and/or decrease the DNA polymerase concentration to a 
range in which the fragment sizes remain constant with 

20 time. 

Pyrophosphoro lysis, or the reversal of the polymerase , 
reaction, can have a similar effect as the 3' to 5' 
exonuclease activity, allowing the DNA polymerase to 
remove the chain terminating dideoxynucleotide and 

25 further increase the length of the fragments (see Tabor 
and Richardson, 265 J. Biol. Chew, 8322, 1990). This 
activity is readily avoided by including pyrophosphatase 
in . the reaction mixture, in order to remove the 
pyrophosphate that accumulates during DNA synthesis and 

30 is a necessary substrate for pyrophospho ro lysis . 

Example 7. Gel electrophoresis -based determination of 
thfi rate of incorporation of dideoxvnucleot ides relative 

3 5 TIP deoxy nucleotides bv varvinc the ratio of dNTPS tQ 

ddNTPs , 

This example is similar to that described in Example 
6 , While it is the preferred test for DNA polymerases 
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that discriminate strongly against the incorporation of 
dideoxynucleotides (e.g. T7 DNA polymerase Y526F, E. coLi 
DNA polymerase I and Taq DNA polymerase) , it also works 
well with DNA polymerases that efficiently incorporate 
5 ddNMPs (e.g. T7 DNA polymerase, E. coli DNA polymerase 
I mutant F762y and Taq DNA polymerase mutant F667Y) . In 
this test, the ratio of ddNTP to dNTP is varied for two 
different DNA polymerase preparations, keeping all other 
aspects of the reactions identical, and the distributions 
10 of dideoxy- terminated radioactively labeled fragments are 
compared to determine the ratios required for the two DNA 
polymerases being tested to obtain fragments of 
comparable average length. 

The average length of a series of fragments is 
15 determined in one of two ways. In the first, which is 
best for DNA polymerases that incorporate ddNMPs 
efficiently/ one inspects the autoradiograph and 
determines the position of the largest fragments visible 
on a given exposure for a series of reactions containing 
20 ddNTP: dNTP ratios that vary by two-fold increments using 
one DNA polymerase, and compares that to an analogous 
series using the second DNA polymerase, to determine the 
ratios required to generate fragments of comparable size 
for the two DNA polymerases. The position of the front 
25 marking the appearance of visible radioactive bands is 
usually relatively sharp and readily observed by eye . 
However, it is also possible to determine such positions 
more precisely using the Phosphoimager to' locate the 
position in each lane where a certain threshold of 
3 0 radioactivity per unit area occurs, starting at the top 
of the gel and moving down the gel. 

Some DNA polymerases discriminate very strongly 
against the incorporation of dideoxynucleotides, in which 
case it is difficult to add sufficient ddNTPs to the 
35 reaction to clearly detect the position of the largest 
dideoxy- terminated fragments on a denaturing 
polyacrylamide gel. For such DNA polymerases, one can 
use an alkaline agarose gel electrophoresis to compare 
the lengths of the dideoxy- terminated fragments in the 
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different series. If one uses a denaturing 

polyacrylamide gel, then an alternative method to 
determine the ratios of ddNTP:dNTP required for the two 
DNA polymerases to generate dideoxy- terminated fragments 
5 of comparable average lengths is to focus on one or 
several bands and determine the ratio of ddNTPs to dNTPs 
required to obtain a specific level of radioactivity in 
those fragments, as analyzed by the Phospho imager, for 
the two DNA polymerases being tested. 

10 These tests were carried out using the six DNA 

polymerases described in Example 6 . The reaction 
conditions were identical to that described in Example 6 
except for the concentrations of dNTPs and ddNTPs. All 
reaction mixtures contained 10 M 4 dNTPs. Each of the 

15 four ddNTP concentrations were varied by two- fold 
increments in the following ranges for the six DNA 
polymerases as follows: T7 DNA polymerase, E. coli DNA 
polymerase I F762Y, and Taq DNA polymerase F667Y , 0.02 
M to 1 M, and T7 DNA polymerase Y526F, E. coli DNA 

20 polymerase I and Taq DNA polymerase F667Y, 100 to 2,000 
M. The reactions were carried out and the samples were 
analyzed by denaturing polyacrylamide gel electrophoresis 
as described in - Example 6. Drying of the gel, 
autoradiography, and Phosphoimager analysis were as 

25 described in Example 6. The Table below summarizes the 
results from this experiment; the values shown for T7 DNA 
polymerase, E. coli DNA polymerase I F762Y, and Taq DNA 
polymerase F667Y are the absolute ratios obtained in 
Example 6 by statistical analysis of the rate of 

30 exponential decay in intensity of dideoxy- terminated 
fragments obtained using a 1:1 ratio of dNTPs to ddNTPs. 
The values obtained for T7 DNA polymerase Y526F, E. coli 
DNA polymerase I and Taq DNA polymerase were obtained by 
determining the ratios of ddNTP to dNTP required to 

35 generate a series of dideoxy-terminated fragments of 
comparable average length to a series generated using T7 
DNA polymerase, E. coli DNA polymerase I F762Y, and Taq 
DNA polymerase F667Y, respectively; i.e., for each pair 
of wild-type and mutant DNA polymerases the ratios of 
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ddNTPsrdNTPs were determined that give a comparable 
distribution of dideoxy- terminated fragments. The 
ddNTPrdNTP ratio used in the reaction with the strongly 
discriminating enzyme (i.e. the one that contains 
5 phenylalanine at the critical position) divided by the 
ddNTPrdNTP ratio used to obtain a comparable distribution 
of dideoxy- terminated fragments with the relatively non- 
discriminating enzyme (i.e. the one that contains 
tyrosine at the critical position) gives a factor that 

10 corresponds to the difference in efficiency between the 
two DNA polymerases in their use of ddNTPs relative to 
the comparable dNTP. This factor was multiplied by the 
absolute ratios obtained for T7 DNA polymerase, E. coli 
DNA polymerase I F762Y, and Taq DNA polymerase F667Y in 

15 Example 6 in order to obtain the values shown below for 
T7 DNA polymerase Y526F, E. coli DNA polymerase I and Taq 
DNA polymerase, respectively. 
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Polymerase 



T7 DNA 
polymerase 



Incorporation Race Ratios 
dG/ddG dA/ddA dT/ddT 



3.2 



3.3 



2.8 



dC/ddC 



3.7 



T7 DNA 
polymerase 
Y526F 



6,400 



7,300 



8,400 



11, 000 



E. coli DNA 
polymerase I 



140 



720 



1, 100 



250 



E. coli DNA 
polymerase I 
F762Y 



0.56 



0.72 



0 .54 



0.75 



Taq DNA 
polymerase 



1,400 



4, 700 



4,500 



2, 600 



Taq DNA 

polymerase F667Y 



0.45 



0.59 



0 .56 



0 .32 



The Table below summarizes the effect of having 
tyrosine in place of phenylalanine at the critical 
selectivity residue of T7 DNA polymerase, E. coli DNA 
polymerase I and Taq DNA polymerase on the use of ddNTPs 
relative to dNTPs. 



Residue 



Average Rate 



Improvement 
in dN/ddN 



use of 
ddNTPs 



T7 DNA 
polymerase 



Tyrosine (WT) 
Phenylalanine 



3 .0 
8 , 000 



3,000 X 



E. coli DNA Phenylalanine (WT) 600 
polymerase I Tyrosine 0 . 6 



1000 X 
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Taq DNA Phenylalanine (WT) 3,000 

polymerase Tyrosine 0.5 6,000 X 



To use this test to determine the extent of 
discrimination of a new DNA polymerase, reactions would 
5 be carried out as described above initially using a wide 
range of ratios of ddNTPs to dNTPs, and comparing the 
distribution of dideoxy- terminated fragments on a 
denaturing polyacrylamide gel to those of a standard, 
e.g. T7 DNA polymerase. Matching the lanes that have 

10 comparable average lengths of DNA fragments, the ratio of 
ddNTPs :dNTPs of the new DNA polymerase is divided by the 
ratio used with T7 DNA polymerase to give the level of 
discrimination against ddNTPs by the new DNA polymerase 
relative to T7 DNA polymerase. 

15 To use this test to determine whether the 

modification of a DNA polymerase results in a decrease in 
its ability to discriminate against ddNTPs (i.e. 
incorporate dideoxynucleotides more efficiently) , an 
identical number of units of modified and unmodified DNA 

20 polymerases would be used in a series of reactions 
containing varying ratios of ddNTPs to dNTPs as described 
above. The average length of dideoxy- terminated 

fragments are compared for identical ratios of ddNTPs to 
dNTPs for the two enzymes. If the modification has 

25 resulted in a DNA polymerase that incorporates 
dideoxynucleotides more efficiently, the average length 
of dideoxy-terminated fragments will be shorter for 
reactions using the modified DNA polymerase compared with 
those using the unmodified DNA polymerase at the same 

30 ratios of ddNTP to dNTP, while the average length will be 
longer for reactions using the modified DNA polymerase if 
the modification resulted in a DNA polymerase that is 
more discriminatory towards ddNTPs. 

This test can also be used to determine whether a 

35 modification of a DNA polymerase results in a decrease in 
its ability to discriminates against analogs of ddNTPs, 
for example fluorescent ly tagged ddNTPs. This is 
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possible even if one does not know the concentration of 
the analogs being tested. As an example of this, we 
compared the ability of Taq DNA polymerase and Tag DNA 
polymerase F667Y to use each of the four DyeDeoxy 
5 Terminators manufactured by Applied Biosystems (part 
number 401150) . These DyeDeoxy Terminators have four 
different fluorescent moieties covalently bound to each 
of the four ddNTPs (see Example 12 for more detail) . For 
each of the DyeDeoxy Terminators, the ratio of dNTPs to 

10 DyeDeoxy Terminators was varied over a 16,000 fold range 
by intervals of two- fold, and the pattern of dideoxy- 
terminated fragments was compared on the autoradiograph 
to determine the ratios required for each of the two 
enzymes to obtain the same average length of dideoxy- 

15 terminated fragments. The Table below summarizes these 
results. For each Terminator, the column labeled "Ratio" 
represents the ratio of the ratios of ddNTP to dNTP 
required to give fragments of identical average length 
for Taq DNA polymerase versus Taq DNA polymerase F6 67y. 

20 As with normal ddNTPs, Taq DNA polymerase F6e7Y 
incorporates the fluorescent ddNTP derivatives much more 
efficiently that does the unmodified Taq DNA polymerase, 
by at least a factor of 400. 

25 DyeDeoxy Terminator Ratio 
~ G Terminator >4 00 

A Terminator >2,000 

T Terminator >2,000 

C Terminator >2,000 
30 . 

As discussed previously, one complication that can 
arise in the use of this test is when the DNA polymerase 
being tested has an associated exonuclease activity. The 
3 5 problems that 5' to 3 ' and 3' to 5' exonucleases can 
cause and ways to minimize their effects when present 
are discussed in Example 6. When testing to determine 
whether the modification of a polymerase decreases its 
ability to incorporate dideoxynucleotides , one class of 
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mutants that can have this effect are ones that 
inactivate a normally very active 3' to 5 ' exonuclease 
activity {see for example Tabor and Richardson 84, Proc . 
Natl. Acad. Sci. 4767, 1987) . This class of mutants are 
5 not claimed in this patent. If one has a modified DNA 
polymerase that gives an apparent increase in the ability 
of the DNA polymerase to incorporate dideoxynucleo tides, 
and one wants to determine whether it is in the 
polymerase domain or the exonuclease domain, it is 

10 necessary to carry out an iexonuclease assay on the 
modified and unmodified forms of the enzyme; a mutation 
that affects primarily the exonuclease activity of the 
enzyme will have a greater effect on the exonuclease 
activity of the enzyme than on the polymerase activity . 

15 Preferably, one would measure the exonuclease activity on 
a DNA substrate labeled at its 3' end with ^^p-dd^Mp (see 
Example 21) . As in Example 6, it is important to inhibit 
pyrophosphorolysis in these reactions in order to avoid 
it increasing the apparent discrimination against ddNTPs 

20 by a DNA polymerase- This is readily accomplished by 
including pyrophosphatase in the reaction. 

Example 8. Determination tJlfi efficiency OL 

inriorporation of dideoxynucleotides bv inhibition of DNA 

25 syntihesis on a single- stranded M13 DNA - unlabeled 4Q-mer 
pr-imer complex 

In this example the sensitivity of a DNA polymerase 
to a ddNTP is determined by measuring the ability of 
varying concentrations of the ddNTP to inhibit a standard 

3 0 DNA synthesis reaction. The DNA synthesis assay is a 
modification of that described in Tabor and Richardson 
264 vJ. Biol. Chem. 6447, 1989. The 40-mer primer and 
the M13 mGPl-2 template are as described in Example 3. 
The primer is annealed to the M13 mGPl-2 single-stranded 

3 5 DNA template in a reaction mixture (IX = 25 1) 
containing 2 g. of M13 mGPl-2 DNA, 6 pmoles of primer (a 
10-fold molar excess to template) , 40 mM Tris HCl, pH 
8.0, 10 mM MgClj and 100 mM NaCl . The mixture is 
incubated 6 5 °C for 2 min and then cooled to room 
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temperature over 3 0 min. The standard reaction mixture 
(45 1) contained 22 mM TrisHCl, pH 8,0, 5,5 mM MgCla, 
55 mM NaCl, 300 M dGTP, dATP, dCTP and [^HJTTP (30 
cpm/pmol) , and varying concentrations of one of the four 
5 or all four ddNTPs . The reaction mixtures also contained 
10 ng of yeast inorganic pyrophosphatase to inhibit 
pyrophosphorolysis that could otherwise increase the 
apparent discrimination by the DNA polymerase (Tabor and 
Richardson 265 J. Biol. Chem. 8322, 1990) . The mixtures 

10 are incubated at 37 for 1 min (70»C for thermophilic 
DNA polymerases) , and the reactions are initiated by the 
addition of 5 1 aliquots of dilutions (0.01 to 1 unit) 
of the DNA polymerase being analyzed, diluted in 20 mM 
TrisHCl, pH 7.5, 10 mM 2-mercaptoethanol and 0.05% 

15 bovine serum albumin. The reaction mixtures are further 
incubated at 37*'C-for 10 min (70*=»C for thermophilic DNA 
polymerases) - The reactions are terminated by the 
addition of 5 1 of 100 mM EDTA, and 45 1 is spotted 
onto Whatman DE81 filter discs. The discs are washed in 

2 0 4 changes of 150 ml of 0.3 M ammonium formate, p H 8.0, 
followed by 2 changes of 150 ml of 90% ethanol, each of 
5-10 min duration. The disks are then dried under a heat 
lamp and counted in a scintillation counter in the 
presence of 5 ml of fluor (Opti-Fluor O, Packard) , From 

2 5 the amount of radioactivity on each disk, the amount of 

total DNA synthesis is calculated. 

Specific DNA polymerases being tested may have 
optimum buffer, pH, salt, or temperature conditions that 
differ from those suggested above. Each DNA polymerase 

3 0 should be tested under the conditions that give optimum 

specific polymerase activity for that enzyme. 

To determine whether a modification of a given DNA 
polymerase results in a decrease in its ability to 
discriminate against dideoxynucleotides , first a series 
35 of reactions are carried out in the absence of ddNTPs, 
varying the DNA polymerase concentration to determine the 
range where the activity varies approximately linearly 
with enzyme concentration for both the modified and 
unmodified forms of the enzyme. An enzyme concentration 
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is chosen that is in this linear range for both forms of 
the enzyme; e.g., an enzyme concentration whereby about 
30% of the template is replicated in the 10 min reaction 
is likely to be in such a linear range . 
5 Once a proper enzyme concentration is chosen, a 

series of reactions are carried out varying the amount of 
either one ddNTP or preferably all four ddNTPs in the 
mixture, in order to determine the concentration required 
CO inhibit 50% of the DNA synthesis. For example, under 
10 the conditions stated above (3 00 M 4dNTPs) , the 
following concentrations of a mixture of 4 ddNTPs are 
required to inhibit 50% of the DNA synthesis for the 
following six DNA polymerases: 



Polymerase [4ddNTP] for 50% inhibition 

T7 DNA polymerase 0.1 ;xM 

T7 DNA polymerase Y526F 3 00 fiM 

20 fM 
0,04 fiM 

150 AtM 
0.4 fiM 



This test can be used to determine if a modification 
in a new DNA polymerase results in a decrease in its 
5 ability to discriminate against ddNTPs; if the mutation 
does have this effect, then a higher concentration of 
4ddNTPs will be required to inhibit 50% of the DNA 
synthesis in the assay described above. 

10 Example 9. Determination of t:he efficiency of 

incorporation of dideoxynucleot ides by measurina tlLS. 

incorporation of ffl-^^PldAMP int<? Synthetic primer- 

templatg> complexes 



E. coli DNA polymerase I 

E. coli DNA polymerase I F762Y 



Taq DNA polymerase 

Taq DNA polymerase F667Y 
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In this example the competition between a dNTP and a 
ddNTP is assayed for use at a single site in a synthetic 
primer- template . This assay differs from the others in 
that it limits the comparison of the use of the two 
5 substrates to a single site, avoiding the complication of 
sequence-specific variability in discrimination. While 
this relatively simple assay is suitable for a 
preliminary screen of DNA polymerases for their ability 
to discriminate against ddNTPs, it should not be used to 

10 the exclusion of the assays presented in Examples 6-8 
since often the discrimination against ddNTPs is strongly 
influenced by the neighboring sequences, an important 
problem for DNA sequence analysis (see for example Tabor 
and Richardson 265 i7. Biol. Chem. 8322, 1990). 

15 The two primer- templates shown below are used in this 

example. The first is used to determine the 

discrimination between dATP versus ddATP, while the 
second is used to determine the discrimination between 
dCTP versus ddCTP, dTTP versus ddTTP, and dGTP versus 

20 ddGTP. 

Primer-Template A: 

5 " GGCGACGTTGTAAAACGACGGCCAGTGCCA 3 ' 

25 

3 ' GCTGCAACATTTTGCTGCCGGTCACGGTTCCCC 5 ' 
Primer-Template B: 
3 0 5* GGCGACGTTGTAAAACGACGGCCAGTGCCA 3' 

3 » GCTGCAACATTTTGCTGCCGGTCACGGTCAGTTTT 5 ' 

35 Each reaction mixture contains 25 pmoles each of 

primer and template. The primer and template are mixed 
together and annealed in a reaction mixture (IX = 10 1) 
containing 40 mM Tris*HCl, pH 8.0, 10 mM MgClj and 100 mM 
NaCl . The mixture is incubated 65 °C for 2 min and then 
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cooled to room temperature over 30 min. The standard 
reaction mixture (45 1) for reactions carried out with 
Primer-Template A contains 22 mM Tris;HCl, pH 8.0, 5 . 5 mM 
MgCla, 55 mM NaCl, 25 pmoles of the Primer-Template A 
5 complex, 5 M [a-"p]dGTP (4,000 cpm/pmole) and varying 
concentrations of dATP and ddATP. The reaction mixtures 
also contained 10 ng of yeast inorganic pyrophosphatase 
to inhibit pyrophosphoro lysis that could otherwise 
increase the apparent discrimination by the DNA 
10 polymerase (Tabor and Richardson 265 J. Biol. Chem. 8322, 
1990) . The mixtures are incubated at 37oc for 1 min 
(70^C for thermophilic DNA polymerases) , and the 
reactions are initiated by the addition of 5 1 aliquots 
(0.01 to 1 unit) of the DNA polymerase being analyzed 
15 diluted in 20 mM TrisHCl, pH 7,5, 10 mM 2- 
mercaptoethanol and 0.05% bovine serum albumin. The 
reaction mixtures are further incubated at 37«C for 10 
min (70°C for thermophilic DNA polymerases) . The 
reactions are terminated by the addition of 5 1 of 100 
20 mM EDTA, and 45 1 is spotted onto Whatman DE81 filter 
discs. The discs are washed in 4 changes of 150 ml of 
0.3 M ammonium formate, p H 8.0, followed by 2 changes of 
150 ml of 90% ethanol, each of 5-10 min duration. The 
disks are then dried under a heat lamp and counted in a 
25 scintillation counter in 5 ml of fluor (Opti-Fluor O, 
Packard) . From the amount of radioactivity on each disk, 
the amount [^''PldGMP incorporated was determined. The 
assumption is made once a single dTU^P residue has been 
incorporated to remove the block for the incorporation of 
30 dGMP residues, four [^^PjdGMPs will be incorporated into 
each primer, and thus the number of dAMPs incorporated 
are one fourth the number of dGMPs incorporated. 

All reactions are carried out with a constant amount 
of the DNA polymerase being analyzed; the amount of DNA 
35 polymerase should be sufficient to replicate 50% of the 
total dCMP residues in the single -stranded region of the 
template in the 10 min incubation in the presence of 10 
M dATP and the absence of ddATP. Specific DNA 
polymerases being tested may have optimum buffer, pH, 
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salt, or temperature conditions that differ from those 
suggested above. Each DNA polymerase should be tested 
under the conditions that give optimum specific 
polymerase activity for that enzyme. Control reactions 
5 should also be carried out in which neither dATP nor 
ddATP are present; this defines the background DNA 
synthesis that should be subtracted from each sample. In 
general this is less than 10% of the DNA synthesis 
obtained when dATP is present. 

10 Reactions are then carried out with 10 M dATP and 

varying concentrations of ddATP, to determine the amount 
of ddATP required to inhibit DNA synthesis by 50%- 
Examples are shown in the Table below for the 
concentration of ddATP required to inhibit 50% of the DNA 

15 synthesis in the presence of 10 M dATP. The polymerases 
are defined as in Example 6. 



Polymerase [ddATP] for 50% inhibition 

20 

T7 DNA polymerase 30 

T7 DNA polymerase Y526F >500 /iM 

E. coli DNA polymerase I >50 0 

25 ,E. coli DNA polymerase I F762Y 6 fiM 

Taq DNA polymerase >500 

Taq DNA polymerase F667Y 5 mM 



30 

In order to carry out an analogous test measuring the 
discrimination against ddGTP, ddTTP or ddCTP, reactions 
are carried out identical to that described above except 
that Primer-Template B is used instead of Primer-Template 
35 A, and the reactions containing 10 M dGTP, dTTP and dCTP 
and 5 M [a -"P] dATP (4, 000 cpm/pmole) and varying 
concentrations of either ddGTP, ddTTP or ddCTP . 

As with the other examples, DNA polymerases with a 3' 
to 5' exonuclease activity can interfere . with this assay, 
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making an enzyme more discriminatory against ddNTPs than 
that due to discrimination at the level of incorporation 
of the analog. In addition, enzymes, with high levels of 
exonuclease activity can use up all the dNTPs in the 
5 reaction mixture (especially with the relatively low 
level of dNTPs present in these reactions) , resulting in 
no net DNA synthesis (e.g. the native T7 DNA polymerase, 
see Tabor and Richardson 264 J. Biol. Chem. 6447, 1989). 
In these cases the concentration of DNA polymerase and 
10 the incubation time of the reaction should be adjusted to 
obtain the maximum level of DNA synthesis in the absence 
of ddNTPs. 

E xamp l e IQ , — Determination of th^:^ effinT^nf-y '^^ 

15 i ncnrporfirlon — fa-"PldfiNMPs into a svni-h^h-i.^ p^-in,.>^- 

temolate c;omp1«>y 

In this example the competition between a dNMP and a 
ddNMP is assayed for incorporation at a single site in a 
synthetic primer- template . This assay differs from that 

20 in Example 9 in that the label is in Ia-"P]ddATP, and 
thus the incorporation of ddAMP is being measured. This 
assay can be used to test whether a ddNTP is inhibiting 
a DNA polymerase by acting as a chain terminator, being 
incorporated into the 3' end of the primer, or simply by 

2 5 binding the DNA polymerase and preventing further DNA 
synthesis without actually being incorporated into the 
primer . 

In the example below the incorporation of ddAMP is 
measured using [a-"P]ddATP and Primer-Template A (Example 
30 9) : 

Primer -Template A: 

5' GGCGACGTTGTAAAACGACGGCCAGTGCCA 3' 
3 ' GCTGCAACATTTTGCTGCCGGTCACGGTTCCCC 5 ' 



35 



Incorporation of [a-"P]ddGMP, [a-^?>]ddCMP and [a- 
"PlddTMP could be similarly tested on the appropriate 
template (for example, Primer-Template B in Example 9> . 
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Each reaction mixture contains 25 pmoles each of 
primer and template (Primer-Template A, see above) . The 
primer and template are mixed together and. annealed in a 
reaction mixture (IX = 10 1) containing 40 mM Tris HCl, 
5 pH 8.0, 10 mM MgClj and 100 mM NaCl . The mixture is 
incubated 65 °C for 2 min and then cooled to room 
temperature over 3 0 min. The standard reaction mixture 
(45. 1) contains 22 mM Tris'HCl, pH 8.0, 5 , 5 mM MgClj, 55 
mM NaCl, 25 pmoles of the Primer-Template A complex, 2.5 
10 M [a-32p]d^Tp (Amersham PB10235, >5000 Ci/mmol diluted 
with cold ddATP to a specific activity of 4,000 
cpm/pmole) and varying concentrations of dATP. The 
reaction mixtures also contained 10 ng of yeast inorganic 
pyrophosphatase to inhibit pyrophosphorolysis that could 
15 otherwise increase the apparent discrimination by the DNA 
polymerase (Tabor and Richardson 265 J*. Biol. Chem, 8322, 
1990) . The mixtures are incubated at 37®C for 1 min 
(70^C for thermophilic DNA polymerases) , and the 
reactions are initiated by the addition of 5 1 aliquots 
20 (0.01 to 1 unit) of the DNA polymerase being analyzed 
diluted in 20 mM Tris-HCl, pH 7.5, 10 mM 2- 
mercaptoethanol and 0.05% bovine serum albumin. The 
reaction mixtures are further incubated at 3 7*^C for 10 
min (70**C for thermophilic . DNA polymerases) . The 
25 reactions are terminated by the addition of 5 1 of 100 
mM EDTA, and 4 5 1 is spotted onto Whatman DE81 filter 
discs. The discs are washed in 4 changes of 150 ml of 
0.3 M ammonium formate, p H 8.0, followed by 2 changes of 
150 ml of 90% ethanol, each of 5-10 min duration. The 
3 0 disks are then dried under a heat lamp and counted in a 
scintillation counter 5 ml of fluor (Opti-Fluor O, 
Packard) . From the amount of radioactivity on each disk, 
the amount [^^P] ddAMP incorporated is determined. 

All reactions are carried out with a constant amount 
35 of the DNA polymerase being tested; the amount of DNA 
polymerase should be that concentration which gives the 
highest level of incorporation of ["P]ddAMP into Primer- 
Template A in the 10 min incubation in the absence of 
dATP. Specific DNA polymerases being tested may have 
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optimum buffer, pH, salt, or temperature conditions that 
differ from those suggested above. Each DNA polymerase 
should be tested under the conditions that give optimum 
specific polymerase activity for that enzyme. 
5 In order to use this assay to determine the level of 

discrimination against a ddNTP, reactions are carried out 
with a constant amount of the DNA polymerase and 
["P]ddATP, in the presence or absence of 2 . 5 M dATP (an 
equimolar concentration to [^^PjddATP) and the effect the 

10 presence of dATP has on the incorporation of ["P] ddAMP 
is determined. If a DNA polymerase does not discriminate 
between the incorporation of ddAMP and dAMP, and it has 
no 3' to 5' exonuclease activity, then the addition of 
dATP will inhibit incorporation of ["P] ddAMP by 50%. 

15 This test is best for a DNA polymerase that 

efficiently incorporates ddNMPs, such as Taq DNA 
polymerase F667Y. For DNA polymerases that discriminate 
strongly against ddNMPs, the previous assay is preferred, 
in which the label is in a dNTP other than the one being 

20 used in competition with the ddNTP, since in that case 
much higher concentrations of the ddNTP can be used. 
However, with DNA polymerases that discriminate strongly 
against ddNMPs, if one is interested in testing whether 
a given mutation is reducing the level of discrimination 

25 against ddNMPs, this assay could be used by assaying the 
unmodified DNA polymerase on this substrate in the 
absence of dATP (measuring the incorporation of [^^P] ddAMP 
as a function of DNA polymerase concentration) , and 
comparing the rate of incorporation to that of the mutant 

30 enzyme. If the mutation is reducing the discrimination 
against ddATP, then the mutant enzyme should have a 
higher specific activity for incorporation of [^^P] ddAMP. 

As with the other examples, DNA polymerases with a 3' 
to 5' exonuclease activity can interfere with this assay, 

3 5 making an enzyme more discriminatory against ddNTPs than 
chat due to discrimination at the level of incorporation 
of the analog. And as in Example 9, enzymes with high 
levels of exonuclease activity can deplete all the dNTPs, 
resulting in no net incorporation of [^^P] ddAMP, In 
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these cases the concentration of DNA polymerase and the 
incubation time of the reaction should be adjusted to 
obtain the maximum level of incorporation of [^^p] ddAMP 
by the DNA polymerase being tested. 
5 All of the above methods are based on radioactivity 

to detect either the length of the extended primer or the 
amount of DNA synthesis on the primer. The efficiency of 
incorporation of dideoxynucleotides by a DNA polymerase 
can also be measured nonradioactively . Two examples are 
10 presented below that make use of either fluorescent 
primers or fluorescent dye-dideoxy terminators that are 
detected on an Applied Biosystems Model 373A DNA 
Sequencing System. 

15 Example 12^ Petermlnat jnn of t:he efficiency of 

incorporation of dideoxynucleotides using a fluorescent 

pr i mer annealed to sinale-st-randed DNA and gel 

electrophoresis 

In this example a f luorescently labeled primer is 

20 annealed to single-stranded DNA and DNA synthesis 
reactions are carried out using vaarying ratios of dNTPs 
to ddNTPs . The samples are then loaded onto an Applied 
Biosystems Model 3 73 A DNA Sequencing System, and the 
length of each fluorescent fragment is determined by 

25 direct fluorescent detection during gel electrophoresis. 
Reactions are carried out as described in Tabor and 
Richardson 265 J. Biol. Chem, 8322, 1990. The 
fluorescent primer used is ""Fam" primer (Applied 
Biosystems). The DNA used is single- stranded M13 mGPl-2 

3 0 DNA as described in Example 3. The primer is annealed 
CO the M13 mGPl-2 single -stranded DNA template in a 
reaction mixture (IX = 10 1) containing 2 g of M13 
mGPl-2 DNA, 5 ng of primer, 4 0 mM TrisHCl, pH 8.0, 10 mM 
MgCla and 100 mM NaCl . The mixture is incubated 65 °C 

3 5 for 2 min and then cooled to room temperature over 3 0 
min. The standard reaction mixture (18 1) contains 22 
mM TrisHCl, pH 8.0, 5 . 5 mM MgClj, 55 mM NaCl, and varying 
concentrations of the 4dNTPs and one of the four ddNTPs. 
The reaction mixtures also contained 10 ng of yeast 
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inorganic pyrophosphatase to inhibit pyrophosphorolysis 
that could otherwise increase the apparent discrimination 
by the DNA polymerase (Tabor and Richardson 265 J. Biol. 
Chew. 8322, 1990) . The mixtures are incubated at 37 «C 
5 for 1 min (70<'C for thermophilic DNA polymerases) , and 
the reactions are initiated by the addition of 2 1 
aliquots (0.01 to 1 unit) of the DNA polymerase being 
analyzed diluted in 20 mM TrisHCl, pH 7.5, 10 mM 2- 
mercaptoethanol and 0.05% bovine serum albumin. The 
reaction mixtures are further incubated at 37«>C for 10 
min (70°C for thermophilic DNA polymerases) . The 
reactions are by the addition of 8 1 of 20 mM DTPA, 1 
M potassium acetate, pH 5.0, and 60 1 of ethanol. After 
centrifugation, the DNA. is resuspended in 6 1 of 80% 
15 formamide, 10 mM TrisHCl, pH 8.0, and 1 mM EDTA, and 
heated at 80 °C for 2 min immediately prior to loading on 
the polyacrylamide gel on the Applied Biosystems 373A DNA 
Sequencing System following the manufacturer's 
procedures . 

20 Specific DNA polymerases being tested may have 

optimum buffer, pH, salt, or temperature conditions that 
differ from those suggested above. Each DNA polymerase 
should be tested under the conditions that give optimum 
specific polymerase activity for that enzyme. The 
25 concentration of DNA polymerase should be sufficient to 
extend most of the primers at least several hundred 
nucleotides or until a dideoxynucleotide has been 
incorporated in the 10 min reaction. 

The ratio of dNTPs to ddNTPs is adjusted to obtain 
3 0 optimum peak intensities for approximately three hundred 
bases. For example, for Tag DNA polymerase approximately 
10 M 4dNTPs and 200-600 M ddNTPs is optimal, while for 
Tag DNA polymerase F667Y approximately 3 00 M 4 dNTPs and 
0.5-5 M ddNTPs is optimal. 
35 To determine whether a modification of a given DNA 

polymerase results in a decrease in its ability to 
discriminate against dideoxynucleocides, reactions should 
be carried out at varying ratios of dNTPs to ddNTPs for 
both the unmodified and modified DNA polymerases, and the 
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intensities of dideoxy- terminated fragments of different 
lengths are compared to determine whether the modified 
DNA polymerase is using ddNTPs more efficiently than the 
unmodified enzyme. 

5 

Example IZ. Determination of the efficiency of 

incorporation of fluorescent dldeoxynu cleot ides by gel 

electrophpresis 

In this example a nonf luorescent primer is annealed 

10 to single-stranded DNA and DNA synthesis reactions are 
carried out using varying ratios of dNTPs to a single 
f luorescently labeled-ddNTP . The samples are then loaded 
onto an Applied Biosystems Model 373A DNA Sequencing 
System, and the length of each fluorescent fragment is 

15 determined by direct fluorescent detection during gel 
electrophoresis. The primer used in this example is the 
40-mer described in Example 3, and the template is the 
single-stranded M13 mGPl-2 described in Example 3. The 
primer is annealed to the M13 mGPl-2 single -stranded DNA 

20 template in a reaction mixture (IX = 10 1) containing 2 
g of M13 mGPl-2 DNA, 6 pmoles of primer (a 10-fold molar 
excess to template), 40 mM TrisHCl, pH 8.0, 10 mM MgCli 
and 100 mM NaCl . The mixture is incubated 65 ®C for 2 
min and then cooled to room temperature over 30 min. The 

25 standard reaction mixture (18 1) contains 22 mM 
Tris HCl, pH 8.0, 5.5 mM MgClj, 55 mM NaCl, and varying 
concentrations of the 4dNTPs plus one of the four 
f luorescently- labeled ddNTPs. The four f luorescently 
labeled ddNTPs are from Applied Biosystems (Taq DyeDeoxy 

3 0 Teirminator Cycle Qequencing Kit, part number 4 01150) , and 
are referred to as G, A, T or C "DyeDeoxy Terminators" 
(manual for Taq DyeDeoxy Terminator Cycle Sequencing Kit, 
part number 901497, Rev. E) . The reaction mixtures also 
contained 10 ng of yeast inorganic pyrophosphatase to 

3 5 inhibit pyrophosphorolysis that could otherwise increase 
the apparent discrimination by the DNA polymerase (Tabor 
and Richardson 265 J. Biol. Chem. 8322, 1990). The 
mixtures are incubated at 37*'C for 1 min (70**C for 
thermophilic DNA polymerases) , and the reactions are 
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initiated by the addition of 2 1 aliquots (0.01 to 1 
unit) of the DNA polymerase being analyzed diluted in 20 
mM Tris-HCl, pH 7.5, 10 mM 2-mercaptoethanol and 0.05% 
bovine serum albumin. The reaction mixtures are further 
5 incubated at 37<>C for 10 min (70°C for thermophilic DNA 
polymerases) . The reactions are by the addition of 8 
1 of 2 0 mM EDTA, 1 M potassium acetate, pH 5.0, and 6 0 
1 of ethanol. After centrif ugation, the DNA is 
resuspended in 6 1 of 80% formamide, 10 mM Tris HCl, pH 
10 8.0, and 1 mM DTPA, and heated at 80 for 2 min 

immediately prior to loading on the polyacrylamide gel on 
the -Applied Biosystems 373A DNA Sequencing System 
following the manufacturer's procedures . 

Specific DNA polymerases being tested may have 
15 optimum buffer, pH, salt, or temperature conditions that 
differ from those suggested above. Each DNA polymerase 
should be tested under the conditions that give optimum 
specific polymerase activity for that enzyme. The 
concentration of DNA polymerase used in these reactions 
2 0 should be that concentration that is sufficient to extend 
most of the primers at least several hxindred nucleotides 
or until a dideoxynucleotide has been incorporated in the 
10 min reaction. For DNA polymerases that have a 5 ' to 
3' exonuclease activity, such as Taq DNA polymerase, the 
25 DNA polymerase concentration must be kept low enough to 
avoid this activity degrading a significant percentage of 
the 5' ends of the fragments. 

To determine whether a DNh polymerase discriminates 
strongly or weakly against a fluorescent ddNTP, reactions 
30 are carried out using 20 M 4dNTPs and 0.01 1 of each 
DyeDeoxy Terminator provided by Applied Biosystems (part 
number 4 01150) . When Taq DNA polymerase is used under 
these conditions, most of the fluorescence is either in 
unincorporated dye-ddNTPs at the leading front of the 
35 gel, or in fragments greater than several hundred bases 
in length. In contrast, with the Taq DNA polymerase 
mutant F667Y, under these conditions mosc of the 
fluorescence is in fragments than are less than several 
hundred bases in length, and a significantly lower 
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percentage of the total fluorescence is in unincorporated 
dye-ddNTPs at the leading front of the gel. 

To determine whether a modification of a given DNA 
polymerase results in a decrease in its ability to 
5 discriminate against dideoxynucleotides , reactions are 
carried out at varying ratios of dNTPs to DyeDeoxy 
Terminators for both the unmodified and modified DNA 
polymerases, and the average length of the resulting 
fluorescent fragments are compared to determine whether 

10 the modified DNA polymerase is using the DyeDeoxy 
Terminators more efficiently than the unmodified enzyme. 

The following examples provide tests for determining 
the uniformity of band intensities produced from dideoxy- 
terminated fragments synthesized by different DNA 

15 polymerases. 

Example 13 ■ Determination of uniformity of incorporation 

of dideoxynucleotides using a single- st randed M13 DNA - 
5J. "P- labeled 4 0-mer primer complex and g£Ll 

20 electrophoresis 

In this example the uniformity of dideoxynucleotide 
incorporation is measured on a 5' ^^P-end labeled primer 
extended on a single-stranded M13 DNA template. Three 
activities can cause variability in band intensity of 

25 dideoxy-terminated fragments. One is exonuclease 

activity that can preferentially at some sequences; this 
is avoided by removal of the activity selectively either 
by chemical or genetic means (see for example Tabor and 
Richardson 264, J. Biol. Chem. 6447, 1989). The second 

3 0 is pyrophosphorolysis ; this is readily avoided by 
including pyrophosphatase in the reaction mixture, which 
degrades the pyrophosphate that accumulates- during DNA 
synthesis and is a necessary substrate for 
pyrophosphorolysis. The third is sequence-specific 

35 variability in the incorporation of dideoxynucleotides. 
Variability in band intensity is detrimental to DNA 
sequence analysis, decreasing the accuracy of the DNA 
sequence determined. This test is designed to compare 
the degree of variability in band intensities in 
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fragments synthesized by different DNA polymerases,. 
including mutant DNA polymerases that may incorporate 
dideoxynucleotides more efficiently. 

The primer, template and reaction conditions in this 
5 Example are identical to that described in Examples 6 and 
7. The template is M13 mGPl-2 single -stranded DNA 
described in Example 3, and the primer is the 40 mer also 
described in Example 3. Reaction conditions used are 
those that are optimum for the DNA polymerase being 
10 tested with regard to buffer, pH, salts, and temperature 
of the reaction. It is preferred that magnesium is the 
only metal ion present in the reaction mixture (i.e. the 
reactions are carried out in the absence of added 
manganese ions) . A concentration of DNA polymerase is 

15 chosen whereby most of the primers are extended in a 10 
min reaction and are terminated by the incorporation of 
a dideoxynucleotide . The ratios of dNTPs to ddNTPs are 
adjusted for the specific DNA polymerase being tested so 
that the average fragment size is approximately 100-300 

20 nucleotides. ddCTP is the preferred ddNTP to use for the 
test of uniformity since fragments terminated with this 
dideoxynucleotide tend to have the most variability in 
intensities (see for example Tabor and Richardson 86 
Proc, Natl. Acad. Sci. 4076, 1989). Gel electrophoresis, 

25 autoradiography, and analysis of the band intensities by 
either scanning of the gel or phosphoimager analysis are 
as described in Example 6 . Electrophoresis is carried 
out until fragments of approximately 55 nucleotides in 
length are at the bottom of the gel (the dye bromphenol 

30 blue has run of f the bottom of the gel and the dye xylene 
cyanol is approximately 8 cm from the bottom of the gel) . 

For a given series of ddNMP-terminated fragments, for 
example a series of ddCMP- terminated fragments, the 
intensities of the first 20 fragments from the bottom of 

35 the gel are determined, preferably by Phosphoimager 
analysis. Alternatively, the autoradiograph can be 
scanned by an imaging densitometer to determine the 
relative intensities of the first 20 fragments. These 
intensities are then analyzed statistically as described 
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in Example 6 in order to determine their variability. 
For example, the values can be plotted using the 
Macintosh program Kaleidograph Version 3.0 (Synergy 
Software) . The resulting plots are fit to an exponential 
5 decay curve using the Kaleidograph library routine for 
this function. R^, the correlation index for the data, 
is calculated by the Kaleidograph library routine. This 
is a measure of the variability in band intensities. The 
values obtained for using a new DNA polymerase are 

10 compared to those obtained using known DNA polymerases, 
for example _28 T7 DNA polymerase (Sequenase Version 2.0, 
United States Biochemical Corporation) in the presence of 
magnesium or manganese (see Tabor and Richardson 265 J. 
Biol. Chem. 8322, 1990), E. coli DNA polymerase I 

15 (either Klenow fragment or Klenow fragment with the 
mutation F762Y) or Taq DNA polymerase (either wild- type 
or the mutant F667Y) . The R^ values obtained with these 
known DNA polymerases are used as standards by which to 
compare a new DNA polymerase for its uniformity. 

20 

Example 14. Determination of uniformity of incomoration 

Qf ra-^^Pl ddNMPs using a single-stranded — E12 — DNa — n 

unlabeled primer complex and ael electrophoresis 

In this example the uniformity of dideoxynucleotide 

25 incorporation is measured using an unlabeled primer 
annealed to a single -stranded M13 DNA template and 
carrying out DNA synthesis in the presence of [a- 
^2p]ddATP. The test described in Example 13 is preferred 
over this one for measuring uniformity of 

30 dideoxynucleotide -terminated fragments, since it is more 
amenable to use of high concentrations of ddNTPs, which 
are required for use in enzymes that discriminate 
strongly against ddNTPs, such as E, coli DNA polymerase 
I, Taq DNA polymerase and T7 DNA polymerase y526F, The 

3 5 test in this example is most suited for use with enzymes 
that incorporate dideoxynucleotides efficiently, such as 
T7 DNA polymerase, E. coli DNA polymerase I F762Y and Taq 
DNA polymerase F667Y. 

The primer, template and general reaction conditions 
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in this example are similar to those described in Example 
8, with the following exceptions. The template is M13 
mGPl-2 single -stranded DNA described in Example 3, and 
the primer is. the 4 0 mer also described in Example 3. 
5 Reaction conditions used are those that are optimum for 
the DNA polymerase being tested with regard to buffer, 
pH, salts, and temperature of the reaction. It is 
preferred that magnesium is the only metal ion present in 
the reaction mixture (i.e. the reactions are carried out 

10 in the absence of added manganese ions) . The reactions 
are carried out with 50 M dGTP, dCTP and dTTP, and 
varying concentrations of dATP and ta-^^P] ddATP . The 
concentrations of dATP and [a-"P]ddATP are chosen to 
maximize the amount of radioactivity in fragments 

15 approximately 100 nucleotides in length. All other 
aspects with respect to gel electrophoresis and analysis 
of the radioactive fragments are as described in Example 
13 . 

20 Example 15, Determination q£ unifcrmity q£ incprporatipn 

of dideoxynucleotides using a single - stranded M13 DNA - 

fluorescentlv- labeled primer CQmpleX and 3^ 

electrophoresis 

In this example, reactions are carried out as 

25 described in Example 11. The template is M13 mGPl-2 
single-stranded DNA described in Example 3, and the 
primer is the 4 0 mer also described in Example 3. 
Reaction conditions used are those that are optimum for 
the DNA polymerase being tested with regard to buffer, 

30 pH, salts, and temperature of the reaction. It is 
preferred that magnesium is the only metal ion present in 
the reaction mixture (i.e. the reactions are carried out 
in the absence of added manganese ions) . A concentration 
of DNA polymerase is chosen whereby most of the primers 

3 5 are extended in a 10 min reaction and are terminated by 
the incorporation of a dideoxynucleotide , The ratios of 
dNTPs to ddNTPs are adjusted for the specific DNA 
polymerase being tested so that the average fragment size 
is approximately 100-200 nucleotides. ddCTP is the 
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preferred ddNTP to use since fragments terminated with 
this dideoxynucleotide tend to have the most variability 
in intensities (see for example Tabor and Richardson 86 
Proc. Natl. Acad. Sci, 4076, 1989). The intensities of 
5 up to the first 50 dideoxy- terminated fragments from the 
primer (approximately 200 nucleotides) are determined, 
and they are analyzed statistically as described in 
Example 13 . The correlation index is determined for 
the DNA polymerase being tested, and compared with that 

10 obtained with known DNA polymerases such as those 
described in Example 13 . Alternatively, the heights of 
the first 50 bands are determined and the ratio of 
heights of adjacent bands are calculated and used as a 
measure of variability; the maximum and mean of these 

15 ratios obtained from reactions carried out with the DNA 
polymerase being tested are compared with the values 
obtained for reactions carried out using known DNA 
polymerases such as those described in Example 13 . 

20 Example 16. Determination of - uniformity of incorporation 

of fluorescent dideoxynucleotides by ael electrophoresis 
In this example, reactions are carried out as 
described in Example 12 . . In order to determine the 
uniformity of incorporation of the DyeDeoxy Terminators 

25 for a specific DNA polymerase, the concentration of dNTPs 
and the specific DyeDeoxy Terminator are chosen to obtain 
f luorescently labeled fragments that average 100-200 
nucleotides in length. The intensity of fluorescence is 
determined for fragments 10 to 4 0 from the primer (the 

3 0 first 10 fragments near the f luorescently laibeled primer 
are ignored) . The fragments are analyzed statistically 
as described in Example 13, and the average variability 
is defined by R^, the correlation index for the data 
fitted to an exponential decay curve. The values 

3 5 obtained for R^ are compared with those obtained using 
known DNA polymerases as described in Example 13 . To 
determine if a specific mutation in a DNA polymerase is 
resulting in that DNA polymerase producing bands that 
have less variability, the R^ value obtained for the 
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mutant DNA polymerase * is compared to that obtained for 
the unmodified DNA polymerase. 

Example 17; — DNA sequence analysis using a DNA polymerase 
5 that efficiently incorporates dideoxynucl eotides 

DNA sequence analysis with a DNA polymerase of this 
invention is carried out using standard procedures, with 
the ratio of dNTPs to ddNTPs adjusted to obtain dideoxy- 
terminated fragments of an average length appropriate 

10 for separation by electrophoresis. For the mutant in the 
large fragment of E. coli DNA polymerase I, **Klenow 
fragment F762Y" , reactions are carried out essentially as 
with modified T7 DNA polymerase and described in Tabor 
and Richardson U. S. patent number 4,795,699, Tabor and 

15 Richardson 84, Proc, Natl. Acad. Sci . USA 4161, 1987, and 
SEQUENASE manual, "Step-By-Step Protocols For DNA 
Sequencing With SEQUENASE" 3rd Edition, United States 
Biochemical Corporation. Since Klenow fragment F762Y 
incorporates dideoxynucleotides approximately 5 times 

20 more efficiently than modified T7 DNA polymerase, the 
concentration of ddNTPs in the extension- termination 
mixtures should be reduced by a factor of five compared 
with the standard mixtures recommended for modified T7 
DNA polymerase (Sequenase manual, supra) . 

25 DNA sequence analysis with a thermostable DNA 

polymerase such as Taq DNA polymerase F667Y is as 
described by Innis et al . 85, Proc, Natl. Acad. Sci. USA 
943 6, 1988, with the following modification. Whereas 
Innis et al . recommend, dNTP/ddNTP ratios of 1:6 

30 dGTPrddGTP, 1:32 dATP:ddATP, 1:48 dTTP:ddTTP and 1:16 
dCTP:ddCTP, these ratios must be adjusted to take into 
account the 3,000- 8,000 fold more efficient use of the 
4 ddNTPs by the Taq DNA polymerase F667y compared with 
the wild- type Taq DNA polymerase. Thus the exnension- 

35 termination reactions with Taq DNA polymerase F667Y 
should contain 100 M 4dNTPs and 0.1-5 M each of the 
four ddNTPs; the exact amount: of each ddNTP should be 
adjusted based on the desired average fragment size for 
optimum determination of DNA sequence. All other aspects 
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of the DNA sequencing reactions and denaturing gel 
electrophoresis are as described in Innis et ai . (supra). 

Example 18: Cy^LLfi DNA sequence analysis using a 

5 thermgstable DNA PQlymerase that efficiently incorporates 

dideoxynucleotides 

Cycle DNA sequencing with a thermostable DNA 
polymerase such as Taq DNA polymerase F667Y is carried 
out as described in Carothers et al. 7 BioTechniques 4 94, 

10 1989 except that: (1) The four deoxy/dideoxy NTP mixes 
contain 250 M of all four dNTPs and 0.1-10 M of 
either ddGTP, ddATP, ddTTP or ddCTP, the exact amount of 
each ddNTP adjusted empirically based on the desired 
average fragment size for optimum determination of DNA 

15 sequence. . (2) Taq DNA polymerase F667Y is used in place 
of Taq DNA polymerase, using the same number of units of 
DNA polymerase as recommended by Carothers ,et al . 
(supra). (3) The reaction mixtures contain 10 ng of 
inorganic pyrophosphatase to inhibit pyrophosphorolysis 

20 that could otherwise increase the apparent discrimination 
by the DNA polymerase at specific sequences, reducing the 
uniformity of band intensities (Tabor and Richardson 265 
jf. Biol. Chem, 8322, 1990) . Preferably this 

pyrophosphatase is purified from a thermophilic organism, 

25 for example from Thermus thermophilus (Hohne et al . 47 
Biomed. Biochim. Acta 941, 1988) . 

Example 19. Automated nynl f> DNA sequencing UUSiUS 

mnd-ifjed Tag DNA polymerase and fluorescent primers 

30 Cycle DNA sequencing with a thermostable DNA 

polymerase such as Taq DNA polymerase F667Y and the 
Applied Biosystems Dye Primers is a modification of the 
procedure described in the Applied Biosystems manual 
(Part Number 901482,. Rev. B) . The procedure is 

35 identical to that described in the manual with the 
following modifications: (1) The dNTP/ddNTP mixes must 
be modified to take into account the more efficient use 
of ddNTPs by Taq DNA polymerase F667Y compared to Taq DNA 
polymerase. The new mixes that should be used in place 
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of the ones listed in the Applied Biosystems manual are 
as follows: 

dG/ddG mix = 100 M cMGTP, dATP, dTTP and dCTP, and 
5 0.05 M ddGTP 

dA/ddT mix = 100 M cMGTP, dATP, dTTP and dCTP, and 
0.3 M ddATP 

dT/ddT mix = 100 M cMGTP, dATP, dTTP and dCTP, and 
0.25 M ddTTP 

10 dC/ddC mix = 100 M cMGTP, dATP, dTTP and dCTP, and 

0.15 M ddCTP 

The concentrations of ddNTPs should be varied to 
optimize the intensity of fluorescence in fragments of 

15 specific size ranges, depending on the application. For 
example, increasing the concentration of ddNTPs will 
increase the fluorescence of fragments of shorter length. 
(2) Taq DNA polymerase F667Y is used in place of Taq DNA 
polymerase. The same number of units of enzyme are used 

2 0 in both cases, when assayed under standard DNA polymerase 
assay conditions. Alternatively, the same number of DNA 
polymerase molecules can be used. (3) The reaction 
mixtures contain 10 ng of inorganic pyrophosphatase to 
inhibit pyrophosphorolysis that could otherwise increase 

2 5 the apparent discrimination by the DNA polymerase at 

specific sequences, reducing the uniformity of band 
intensities (Tabor and Richardson 265 J. Biol. Chem. 
8322, 1990) . Preferably this pyrophosphatase is purified 
from a thermophilic organism, for example from ThBrmus 
30 thermophilus (Hohne et al . 47 Biomed. Biochim. Acta 941, 
1988) . All other aspects of the procedures are identical 
to that described in the Applied Biosystems manual 
{supra) . 

3 5 Example 20. Automate d rycle DNA f?equencinq USinq 

modified Tag DN A polymerase and f lUQCSSCenL dy£ 

tfirmina^Qrs 

Cycle DNA sequencing with a thermostable DNA 
polymerase such as Taq DNA polymerase F667Y and the 
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Applied Biosystems DyeDeoxy Terminators is a modification 
of the procedure in the Applied Biosystems manual (Part 
Number 9014 97, Rev. E) . The procedure is identical to 
that described in the manual with the following 
5 modifications: (1) The manual calls for the use of 1 1 
of each of the 4 DyeDeoxy Terminators undiluted in each 
sequencing reaction (20 1 reaction). In this Example 
the Terminators should be diluted prior to addition to 
the sequencing reaction mixture, since they are 
10 incorporated more than several hundred fold more 
efficiently with Taq DNA polymerase F667Y than with Taq 
DNA polymerase. 1 1 of each of the following dilutions 
are added to each sequencing reaction in place of 1 1 of 
the undiluted Terminator solutions : 

15 

DyeDeoxy G Terminator, 1 to 500 in HjO 
DyeDeoxy A Terminator, 1 to 1,500 in H2O 
DyeDeoxy T Terminator, 1 to 1,500 in HjO 
DyeDeoxy C Terminator, 1 to 1,000 in HjO 

20 

These dilutions are approximations only; the precise 
' dilution of each DyeDeoxy Terminator should be determined 
empirically depending on the number of bases of DNA 
sequence to be determined from the primer. (2) Taq DNA 

25 polymerase F667Y is used in place of Taq DNA polymerase. 
The same number of units of enzyme are used in both 
cases, when assayed under standard DNA polymerase assay 
conditions. Alternatively, the same number of DNA 
polymerase molecules can be used. (3) The reaction 

30 mixtures contain 10 ng of inorganic pyrophosphatase to 
inhibit pyrophosphorolysis that could otherwise increase 
the apparent discrimination by the DNA polymerase at 
specific sequences, reducing the unifoirmity of band 
intensities (Tabor and Richardson 265 J. Biol. Chem, 

35 8322, 1990) . Preferably this pyrophosphatase is purified 
from a thermophilic organism, for example from Thermus 
thermophilus (Hohne et ai . 47 Biomed. Biochiw. Acta 941, 
1988) , 
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Since this, procedure uses over 500 times less 
DyeDeoxy Terminators that previous procedures, there is 
much less of a problem with unincorporated DyeDeoxy 
Terminators after the reactions is complete. Thus it is 
5 not necessary to remove the unincorporated DyeDeoxy 
Terminators by .passing the sample over a spin column, as 
recommended in the Applied Biosystems manual (supra) . 
Rather, the sample can be precipitated with ethanol, 
taken up in 5 1 deionized formamide and 1 1 50 mM EDTA, 
10 pH 8.0, heated at 90 °C for 2 min, and loaded onto the 
Applied Biosystems 3 73A DNA Sequencing System according 
to the instructions in the 373A User's Manual. It is 
possible that with a DNA polymerase that incorporates the 
DyeDeoxy Terminators efficiently (such as Taq DNA 

15 polymerase F667Y) that concentration of the DNA 
sequencing reaction by precipitation with ethanol is not 
necessary; the DNA cycle sequencing reactions, carried 
out using preferably a high concentration of primer and 
dNTPs (see below) , can be terminated by the addition of 

20 an equal volume of deionized formamide, heated at 90 

for 2 min, and loaded immediately onto the Applied 
Biosystems 373A DNA Sequencing System. This represents 
a major savings in time to the researcher preparing the 
samples for DNA sequence determination. 

25 The procedure described above uses a relatively low 

concentration of dNTPs at the outset (7.5 M dATP, dTTP 
and dCTP and 37.5 M dITP) . The concentration of dNTPs 
decrease during the cycle DNA sequencing reaction as the 
dNTPs are used. This concentration of dNTPs (less than 

30 7.5 M) is less than optimal for maximum DNA polymerase 
activity for most DNA polymerases. This low 

concentration was necessary with previous protocols 
because the DNA polymerase that was used discriminated 
strongly against ddNTPs, requiring a high ratio of ddNTPs 

35 to dNTPs. The use of a DNA polymerase of this invention 
that discriminates much less against the DyeDeoxy 
Terminators now allows the use of higher concentrations 
of dNTPs. For example, in the protocol described above 
a 10-fold higher concentration of dNTPs and DyeDeoxy 
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Terminators could be used; i.e. 75 



M dATP, dTTP and 



dCTP, 375 M dITP, and the following dilutions of each 
of the four DyeDeoxy Terminators: DyeDeoxy G Terminator, 
1 to 50 in HjO; DyeDeoxy A Terminator, 1 to 150 in IJ O; 
5 DyeDeoxy T Terminator, 1 to 150 in HjO; DyeDeoxy C 
Terminator, 1 to 100 in HjO, Thus in this example the 
DyeDeoxy. Terminator concentrations are still at least 
lower than that in previous protocols by a factor of at 
least 50. 



ter-minated DNA substrat^e 

The 3* ["P] ddAMP-terminated DNA substrate is prepared 
by digesting native calf thymus DNA with Hindlll in a 

15 reaction mixture (50 1) that contains 10 g double- 
stranded calf thymus DNA, 4 0 mM Tris.HCl, pH 7 . 5 , 10 mM 
MgClj/ 5 raM dithiothreitol , 50 mM NaCl, 50 g/ml bovine 
serum albumin, and 10 units Hindlll. After incubation at 
37 for 60 min, 5 1 of [a-"P] ddATP (Amersham PB10235, 

20 >5000 Ci/mmol) and 5 units Sequenase Version 2.0 (United 
States Biochemical Corporation, catalog number 70775) are 
added, and the mixture is incubated at 20**C for 15 min. 
The reaction mixture is extracted once with an equal 
volume phenol : chloroform: isoamyl alcohol (24:24:1), and 

2 5 fractionated on a 1 ml column of Sephadex GlOO 

(Pharmacia) in 20 mM Tris HCl, pH 7-5, 2 mM EDTA, 100 mM 
NaCl- The 3' ^^P-labeled DNA, that elutes in the void 
volume, has a specific activity of approximately 500 
cpm/ng of total DNA. 
30 Reaction mixtures for exonuclease assays (90 1) 

contain 40 mM TrisHCl, pH 7.5, 10 mM MgClj, 10 mM 
dithiothreitol, 50 mM KCl and 1 nmol 3 • "P-labeled DNA. 
The reaction mixtures also contain 10 ng of yeast 
inorganic pyrophosphatase to remove trace amounts of 

3 5 pyrophosphate and thus prevent pyrophosphorolysis (Tabor 

and Richardson 265 j. Biol. Chew, 8322, 1990). This 
mixture is preincubated at 2 0 ''C for 1 min, then 10 1 of 
the appropriate enzyme dilution is added. After 
incubation at 37 oc for the indicated times, the 



10 



Example 21 



Exonucleasf^ assay using a F^^pl ddAMP 
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reactions are stopped by the addition of 3 0 1 of bovine 
serum albumin (10 mg/ml) and 3 0 1 of trichloracetic acid 
(100% w/v) . The precipitated DNA is incubated at 0 ^'C 
for 15 min, and then pelleted by centrif ugation at 12,000 
5 g for 3 0 min. The acid- soluble radioactivity is measured 
in 100 1 of the supernatant- One unit of 3* ["Pjdd/^P- 
DNA exonuclease activity catalyzes the acid 
solubilization of one pmol of [^^P]ddAMP in 15 min. 

Other embodiments are within the following claims. 
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) GENERAL INFORMATION: 
(i) APPLICANT: 

(A) NAME 

(B) STREET 

(C) CITY 

(D) STATE 

(E) COUNTRY 

(F) POSTAL CODE 



(ii) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE 

(B) STREET 



(C) CITY 

(D) COUNTRY 

(E) POSTAL CODE 

(A) TELEPHONE: 

(B) TELEFAX: 

(C) TELEX: 

(iii) NUMBER OF SEQUENCES: 

(iv) TITLE OF INVENTION: 



President & Fellows of 

Harvard College 

17 Quincy Street 

Cambridge 

Massachusetts 

United States of America 

02138 



Richard J. Warburg 
Lyon & Lyon 
633 West Fifth Street 
Suite 4700 

Los Angeles, CA 90071 

USA 

90071 

(213) 955-0440 
(213) 489-1600 
67-3510 

24 

DNA POLYMERASES HAVING 
MODIFIED NUCLEOTIDE BINDING 
SITE FOR DNA SEQUENCING 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 

(B) COMPUTER: 

(C) OPERATING SYSTEM: 

(D) SOFTWARE: 

(vi) CURRENT APPLICATION DATA: 



3-5" Diskette, 1.44 Mb 
storage 

IBM PS/2 Model 50Z or 55SX 
IBM P.C. DOS (Version 3^.30) 
WordPerfect (Version 5.0} 



(A) APPLICATION NUMBER: 

(B) FILING DATE: 



NOT YET ASSIGNED 
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(vii) PRIOR APPLICATION DATA: 

Prior applications total, 
including application 
described below: 



TWO 



(A) APPLICATION NUMBER; 

(B) FILING DATE: 

(A) APPLICATION NUMBER; 

(B) FILING DATE: 



08/324,437 
17-OCT-1994 

US NOT YET ASSIGNED 
10 -NOV- 1994 



(2) INFORMATION FOR SEQ ID NO: 1 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 

(B) TYPE: 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: 



14 

amino acid 
single 
linear 



(ii) SEQUENCE DESCRIPTION : SEQ ID NO: 1: 

Arg Arg Ser Ala Lys Ala lie Asn Phe Gly Leu lie Tyr Gly 

5 10 



(2) INFORMATION FOR SEQ ID NO: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 

(B) TYPE: 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: 



14 

amino acid 

single 

linear 



(ii) SEQUENCE DESCRIPTION : SEQ ID NO: 2: 

Arg Arg Ala Ala Lys Thr lie Asn Phe Gly Val Leu Tyr Gly 

5 10 



(2) INFORMATION FOR SEQ ID NO: 
(i) SEQUENCE CHARACTERISTICS: 

I A) LENGTH: 14 
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(B) TYPE: amino acid 

(C) . STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION : SEQ ID NO: 3: 

Arg Asp Asn Ala Lys Thr Phe lie Tyr Gly Phe Leu Tyr Gly 

5 10 
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(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION : SEQ ID NO: 4: 

Arg Asp Asn Ala Lys Thr Phe lie Tyr Gly Phe Leu Tyr Gly 

5 10 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION : SEQ ID NO: 5: 

Arg Arg Ser Ala Lys Ala lie Asn Phe Gly Leu lie Tyr Gly 

5 10 . 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION : SEQ ID NO: 6: 

Arg Arg Ser Ala Lys Thr Phe lie Tyr Gly Phe Leu Tyr Gly 

5 10 
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(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION : SEQ ID NO: 7: 



Arg Asp Asn Ala Lys Ala lie Asn Phe Gly Phe Leu Tyr Gly 

5 10 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION : SEQ ID NO: 8: 

Arg Asp Asn Ala Lys Ala lie lie Tyr Gly Phe Leu Tyr Gly 

5 10 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION : SEQ ID NO: 9: 

Arg Asp Asn Ala Lys Thr Phe Asn Phe Gly Phe Leu Tyr Gly 

5 10 

(2) INFORMATION FOR SEQ ID NO: 10: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 14 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION : SEQ ID NO : 10: 
Arg Asp Asn Ala Lys Thr Phe Asn Tyr Gly Phe Leu Tyr Gly 

5 10 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQXJENCE CHARACTERISTICS: 

(A) LENGTH: 14 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) SEQXJENCE DESCRIPTION : SEQ ID NO: 11: 

Arg Asp Asn Ala Lys Thr Phe lie Phe Gly Phe Leu Tyr Gly 

5 10 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION : SEQ ID NO: 12: 

Arg Arg Ser Ala Lys Ala lie Asn Phe Gly Leu lie Tyr Gly 

5 10 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) SEQUENCE DESCRIPTION : SEQ ID NO: 13: 

Arg Asp Asn Ala Lys Thr Phe lie Tyr Gly Phe Leu Tyr Gly 

5 10 

(2) INFORMATION FOR SEQ ID NO: 14: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 14 



(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION : SEQ ID NO: 14: 

Arg Arg Ser Ala Lys Thr Phe He Tyr Gly Leu He Tyr Gly 

5 10 

(2) INFORMATION FOR SEQ ID NO : 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION : SEQ ID NO: 15: 

Arg Arg Ser Ala Lys Thr Phe Asn Phe Gly Leu He Tyr Gly 

5 10 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION : SEQ ID NO: 16: 
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Arg Arg Ser Ala Lys Ala lie lie Tyr Gly Leu lie Tyr Gly 

5 10 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION : SEQ ID NO: 17: 

Arg Arg Ser Ala Lys Ala lie lie Phe Gly Leu lie Tyr Gly 

5 10 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION : SEQ ID NO: 18: 

Arg Arg Ser Ala Lys Ala lie Asn Tyr Gly Leu lie Tyr Gly 

5 10 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION : SEQ ID NO: 19: 

Arg Arg Ala Ala Lys Thr lie Asn Phe Gly Val Leu Tyr Gly 

5 10 

(2) INFORMATION FOR SEQ ID NO: 20: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 14 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION : SEQ ID NO: 20: 

Arg Asp Asn Ala Lys Thr lie Asn Phe Gly Val Leu Tyr Gly 

5 10 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION : SEQ ID NO: 21: 

Arg Arg Ala Ala Lys Thr Phe lie Tyr Gly Phe Leu Tyr Gly 

5 10 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION : SEQ ID NO: 22: 

Arg Arg Ala Ala Lys Thr lie lie Tyr Gly Val Leu Tyr Gly 

5 10 

(2) INFORMATION -FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) SEQUENCE DESCRIPTION : SEQ ID NO: 23: 

Arg Arg Ala Ala Lys Thr lie lie Phe Gly Val Leu Tyr Gly 

5 10 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION : SEQ ID NO: 24: 

Arg Arg Ala Ala Lys thr lie Asn Tyr Gly Val Leu Tyr Gly 

5 10 
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Claims 

1. Modified gene . encoding a modified DNA polymerase 
wherein said modified gene is modified to increase the 

5 ability of said polymerase to incorporate a 
dideoxynucleotide relative to the corresponding 
deoxynucleotide compared to the ability of the 
corresponding naturally-occurring DNA polymerase by at 
least 20-fold. 

10 

2. The modified gene of claim 1 wherein said 
modified DNA polymerase has sufficient DNA polymerase 
activity for use in DNA sequencing when combined with any 
factor necessary for said DNA polymerase activity. 

15 

3 . The modified gene of claim 1 or claim 2 wherein 
said modified DNA polymerase has sufficiently low 
exonuclease activity to allow said polymerase to be used 
in DNA sequencing . 

20 

4 . The modified gene of any of claims 1 to 3 
wherein said modified DNA polymerase has a 
dideoxynucleotide binding site comprising a tyrosine 
residue. 

25 

5 . The modified gene of claim 4 wherein said 
polymerase is a T7-type DNA polymerase is selected from 
the group consisting of T7, T3, 01, 011, H, W31, gh-1, Y, 
A1122, and SPG. 

30 

6 . A modified DNA polymerase encoded by the 
modified gene of any of claims 1-5. 

7 . The modified DNA polymerase of claim 6 wherein 
3 5 said polymerase is a Pol I -type polymerase modified at an 
amino acid site in a region selected from that 
corresponding to DNA Polymerase I of E. coli in regions 
666-682, 710-755, 755-784, 798-867, and 914-928. 
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8. The modified gene of any of claims 1 co 4 
wherein said modified DNA polymerase is a thermostable 
enzyme , 

5 9 . The modif ied gene of claim 8 wherein said 

thermostable enzyme is selected from the group consisting 
of the DNA polymerase encoded by Thermus aguaticus, 
Thermus thermophllis , Thermus flavus. Bacillus 
sterothermophilus, The2:Tnococcus litoralis, and 
10 Pyrococcus furiosis . . 

10. The modified gene of any of claims 1 to 5 or 8 
and 9 wherein said ability of said polymerase to 
incorporate a dideoxynucleotide relative to the 
15 corresponding deoxynucleotide is increased at least 25- 
fold compared to the corresponding naturally-occurring 
DNA polymerase compared with the corresponding 
deoxynucleotide. 



20 



25 



11. The modified gene of claim 10 wherein said 
ability is increased at least 50-fold. 

12. The modified gene of claim 10 wherein said 
ability is increased at least 100-fold. 

13. The modified gene of claim 10 wherein said 
ability is increased at least 500-fold. 

14. Method for production of a modified DNA 
3 0 polymerase having an increased ability to incorporate a 

dideoxynucleotide relative to the corresponding 
deoxynucleotide compared to the ability of a 
corresponding naturally-occurring DNA polymerase 
comprising the steps of ; providing a nucleic acid 
3 5 molecule encoding a DNA polymerase and mutagenizing said 
nucleic acid molecule to incorporate one or more base 
changes in the nucleotide base sequence at one or more 
sites which alter the ability of said polymerase encoded 
by said nucleic acid to incorporate a dideoxynucleotide 
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by -at least 20-fold. 

15. Method for determining the nucleotide base 
sequence of a DNA molecule comprising the steps of : 
5 incubating a DNA molecule annealed with a primer 

molecule able to hybridize to said DNA molecule in a 
vessel containing at least one deoxynucleoside 
triphosphate, a DNA polymerase modified from a naturally- 
occurring DNA polymerase to have an increased ability to 

10 incorporate a dideoxynucleotide compared to said 
naturally- occurring polymerase, said polymerase having 
sufficient DNA polymerase activity and sufficiently low 
exonuclease activity to be useful for DNA sequencing, and 
at least one DNA synthesis terminating agent which 

15 terminates DNA synthesis at a specific nucleotide base; 
and 

separating the DNA products of the incubating 
reaction according to the size whereby at least a part of 
the nucleotide base sequence of said DNA molecule can be 

2 0 determined. 

16- The method of claim 15 wherein said DNA 
polymerase is a thermostable DNA polymerase and said 
sequencing is performed at a temperature above 50^C. 

25 

17. The method of claim 16 wherein said DNA 
polymerase is a thermostable DNA polymerase and said 
sequencing is performed at a temperature above 6 0°C. 

30 18. The method of claim 17 wherein said 

thermostable enzyme is selected from the group consisting 
of the DNA polymerase encoded by Thermus aguaticus, 
Thermus thermophilis , Thermus flavus, Bacillus 
sterothermophilus , Thermococcus litoralis, and 

3 5 Pyrococcus furiosis . 

19. The method of any of claims 15 to 17 wherein 
said DNA polymerase distinguishes between 

deoxynucleotides and dideoxynucleotides by less than a 
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factor of 100 . 

20. The method of any of claims 15 to 17 wherein 
said DNA polymerase distinguishes between 

5 deoxynucleotides and dideoxynucleotides by less than a 
factor of 50. 

21. The method of any of claims 15 to 20 wherein 
said DNA polymerase has less than 1000 units of 

10 exonuclease activity per mg of polymerase. 

22. A kit for DNA sequencing comprising a modified 
DNA polymerase modified to increase the ability of said 
polymerase to incorporate a dideoxynucleotide compared to 

15 the ability of the corresponding naturally-occurring DNA 
polymerase by at least 20-fbld; and a reagent necessary 
for said sequencing selected from the group consisting of 
dITP, a chain terminating agent, deaza-GTP and ^ a 
manganese -containing compound. 

20 

23. A. method for sequencing a strand of DNA 
comprising the steps of: 

providing said strand hybridized with a primer able 
to hybridize to said strand, to give an hybridized 

25 mixture, 

incubating said hybridized mixture with one or more 
deoxyribonucleoside triphosphates, a modified DNA 
polymerase modified to increase the ability of said 
polymerase to incorporate a dideoxynucleotide compared to 

3 0 the ability of the corresponding naturally-occurring DNA 
polymerase, and a first chain terminating agent, wherein 
said DNA polymerase causes said primer to be elongated to 
form a first series of first DNA products differing in 
the length of the elongated primer, each said first DNA 

3 5 product having a said chain terminating agent: at its 
elongated end, the number of molecules of each said first 
DNA products being approximately j the same for 
substantially all DNA produces differing in length by no 
more than 20 bases, and providing a second chain 
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terminating agent in said hybridized mixture at a 
concentration different from said first chain terminating 
agent, wherein said DNA polymerase causes production of 
a second series of second DNA products differing in the 
5 length of the elongated primer, each said second DNA 
product having said second chain terminating agent at its 
elongated end, the number of molecules of each said 
second DNA products being approximately the same for 
substantially all second DNA products differing in length 
10 from each other by from 1 to 2 0 bases, and being 
distinctly different from the number of molecules of all 
said first DNA products having a length differing by no 
more than 20 bases from that of said second DNA products. 

15 24, Method for sequencing a nucleic acid 

comprising: 

(a) combining an oligonucleotide primer, a nucleic 
acid to be isequenced, between one and four 
deoxyribonucleoside triphosphates, a modified DNA 

20 polymerase modified to increase the ability of said 
polymerase to incorporate a dideoxynucleotide compared to 
the ability of the corresponding naturally-occurring DNA 
polymerase, and at least two chain terminating agents in 
different amounts, under conditions favoring extension of 

25 said oligonucleotide; primer to form nucleic acid 
fragments complementary to the nucleic acid to be. 
sequenced; separating the nucleic acid fragments by size; 
and determining nucleic acid sequence wherein said agents 
are differentiated from each other by intensity of a 

3 0 label in the primer extension products. 

25. An automated DNA sequencing apparatus 

comprising: 

a reactor comprising reagents which provide at least 
3 5 two series of DNA products formed from a single primer 
and a DNA strand, wherein said reagents comprise a 
modified DNA polymerase modified to incifease the ability 
of said polymerase to incorporate a dideoxynucleotide 
compared to the ability of the corresponding naturally- 
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occurring DNA polymerase, each said DNA product of a said 
series differing in molecular weight and having a chain 
terminating agent at one end, separating means for 
separating said DNA product along one axis of the 
5 separator to form a series of bands, the intensity of 
substantially all nearby bands in a series being 
approximately the same, and the intensity of 
substantially all nearby bands in any one series being 
different from those of other series, band reading means 

10 for determining the position and intensity of each said 
band after said separating along said axis, and computing 
means that determines the DNA sequence of said DNA strand 
solely from said position and intensity of said bands 
along said axis and not from the wavelength of emission 

15 of light from any label that may be present in the 
separating means, 

26- A method for in vitro mutagenesis of a cloned 
DNA fragment comprising providing said cloned fragment 

20 and a modified DNA polymerase modified to increase the 
ability of said polymerase to incorporate a 
dideoxynucleotide compared to the ability of the 
corresponding naturally-occurring DNA polymerase, 
contacting said cloned fragment with said polymerase 

25 under conditions for synthesizing a DNA strand from said 
fragment, wherein said conditions cause formation of said 
DNA strand by incorporation of a plurality of individual 
contiguous bases able to base-pair with said fragment and 
incorporation of a nucleotide base unable to base pair 

30 with said fragment. 

27. A method for in vitro mutagenesis of a template 
DNA fragment comprising providing a primer and said 
template, said primer having contiguous bases able to 
35 base-pair with contiguous bases of said template except 
at least one base which is unable to base-pair with said 
template, and extending said primer with a modified DNA 
polymerase modified to increase the ability of said 
polymerase to incorporate a dideoxynucleotide compared to 
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the ability of the corresponding naturally-occurring DNA 
polymerase. 

28. A method for producing blunt-ended double- 
5 stranded DNA from a linear DNA. molecule having a 5' end 

comprising a single-stranded region, wherein the 3* end 
of said molecule is double stranded and has no 3 ' 
protruding termini, comprising incubating said DNA 
molecule with a modified DNA polymerase modified to 
10 increase the ability of said polymerase to incorporate a 
dideoxynucleotide compared to the ability of the 
corresponding naturally-occurring DNA polymerase, wherein 
said polymerase acts on said single -stranded region to 
produce a blunt-ended double stranded DNA molecule. 

15 

29. A method for labeling the 3' end of a DNA 
fragment comprising incubating said DNA fragment with a 
modified DNA polymerase modified to increase the ability 
of said polymerase to incorporate a dideoxynucleotide 

20 compared to the ability of the corresponding naturally- 
occurring DNA polymerase, produced from recombinant DNA, 
and a labelled deoxynucleotide species under conditions 
in which said 3 • end of said DNA fragment is extended by 
said polymerase and thereby labelled by addition of said 

25 labelled deoxynucleotide to said DNA fragment. 

30. A method of amplification of a DNA sequence 
comprising annealing a first and second primer to 
opposite strands of a double -stranded DNA sequence and 

3 0 incubating the annealed mixture with a modified DNA 
polymerase modified to increase the ability of said 
polymerase to incorporate a dideoxynucleotide compared to 
the ability of the corresponding naturally-occurring DNA 
polymerase, wherein said first and second primers anneal 

35 to opposite strands of said DNA sequence with their 3* 
ends directed towards each other after annealing, and 
with the DNA sequence to be amplified located between the 
two annealed primers. 
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31. Thermus aguaticus DNA polymerase having a 
tyrosine at residue 667. 

32. E. coli DNA polymerase I having a tyrosine at 
5 residue 762 . 

33. Pol I type DNA polymerase having a tyrosine 
residue at the analogous location to E, coli DNA 
polymerase residue 762. 

10 

34. The Pol I type DNA polymerase . of claim 3 3 
wherein said location comprises the amino acid sequence 
K Ni 3N 4 N 5 Nfi N7. N Y G/Q, wherein each N is 
independently any amino acid. 

15 

35. A DNA polymerase alpha having the sequence K Ni 
N2 N3 H Ig Y G, wherein each N is independently any 
amino acid, and wherein one said N is mutated to provide 
a polymerase which does not discriminate against ddNMP 

20 incorporation more than 50 fold more than dNMP 
incorporat ion . 

36. Nucleic acid encoding any of the DNA 
polymerases of claims 31-35. 

25 

37. DNA polymerase which in the presence of 
magnesium as the only added divalent cation has an 
average processivity of less than 100 and discriminates 
less than 100 times against incorporation of a ddNMP 

3 0 compared to a dNMP, wherein said polymerase is not 
reverse transcriptase. 

38. DNA polymerase which in the presence of 
magnesium as the only divalent cation has an average 

3 5 processivity of less than 50 and discriminates less than 
50 times against incorporation of a ddNMP compared to a 
dNMP. 



39 . 



DNA polymerase which in the presence of 
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